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PART  I  -  AN  OVERVIEW 


A.  OHJH.TIVLI.,  ‘oCOf’L,  AND  ViLTHOlK)LOGY 

Ihir  for  rho  ndt i ori-w i lie  stuJy 

oi  ,i!|  fivii  i I i  nd f  i >,)n  f)f  In- 

t'.Mri.ilion)  r.inio  from  our  doairo  Ir'^  iedrn 
till.!.'  oboiit  ffn)  options  .tv.iilabio  for  the 
.  i;.ji  I  i  LO  t  i  on  o!  fMo  Jill  concept  prior  to 
out  own  (i(!cisions  .ibout  its  possible  use 
in  Ait  I  or ce  i osodf n  h  and  dovoiopment 
a  fivilios.  lo  It) is  end  we  soucjht  data 
v>n  usei  popul  it  ioi!'>,  lilerature  coveratje, 
im'thods  of  *'■, t ab  1  i slii nfj  and  ma i r: tai ni nrj 
profiles,  as  well  iniornuiti  n  on  the 
•iupporting  otiuipmiVi!  and  special  tech- 
ni'iaos. 

The  linditxjs  and  conclusions  of  this 
rt  porf  have  been  dr.iwn  trom  the  analysis 
of  operational  char.ic  fori  sties  in  thlrty- 
I'i'll't  SOI  services  of  the  United 

bt.ifos,  and  three  ('’■)  SD!  services  of  the 
bi  i  tod  Kingdom.  Itir.  represented  a  tnajor 
portion  i»t  ••Airin>  t  i  I  ty  (bU)  systems  known 
to  he  in  operation  I'l  of  August  I9(»7,  and 
lii.ue! ore  tan  lie  t.iKtin  as  a  fair  sample 
of  I  t)»f  total  popu  I  a  1 1  on , 

Our  principal  data  collection  and 
vt  i  fieri  Ion  tools  consisted  of  structur¬ 
ed  .|ue>,  t  iontiairps  (shown  in  Part  III), 

*  s  ute  infer  views,  and  prhlished  1 1  rera- 

t  .» e  on  •.ptnillc  ’«)!  ••.ystoms  (Part  III). 
i>>  Innure  aci  uracy  oi  data  wo  asked  all 

•  I  oui  r e),put.dnn1s  to  review  and  conmjnt 
-  i!  our  Hrsf  ifralf  report.  Their  ro- 

.i,-.j>s  and  .  f  itigue  have  been  incor- 
joi  lied  in  this  r<*port. 

SHI  ‘..uvia  DtriNLO 

Ii.i>  lir.t  difltrulty  we  encountered 
ii.  ttic<  lOur  .«  vd  this  study  was  defining 


the  concept  of  SDI.  The  original  con¬ 
cept  was  proposed,  elaborated  and  demon¬ 
strated  oy  the  late  H.  P.  Luhn  in  1958. 
Luhn's  notion  was  that  computers  were 
ideally  suited  for  screening  large  num- 
hors  of  indexed  documents  and  matching 
them  with  terms  which  describe  the  scien¬ 
tist’s  profile.  Such  screening  would  be 
automatic  and  periodic;  the  scientists 
would  receive  "tailor-made"  announcements 
about  items  of  individual  interest  no 
matter  where  published  and  thereby  manage 
to  keep  up  with  the  literature  despite 
the  proliferation  of  Journals  and  techni¬ 
cal  reports  and  at  the  same  time  save 
time  and  effort. 


The  idea  of  selective  dissemination 
itself  was  not  new,  however,  since  It 

had  already  bean  widely  practiced  by 

many  librarians,  who  could  now  call  their 
service  non-computer  I  zed  SDI.  Thus 
eventually  the  SOI  concept  came  to  em¬ 
brace  many  non-eutomated  systems  which 
furnished  the  user  with  notices  about  new 


literature.  It  was  the  selectivity  that 

Attracted  »r  attention.  We  therefore 

defined  SOI  as  follows: 


SDI  is  a  document-alerting  service 
which  selectively  notifies  users 
about  new,  or  newly  access lonod, 

I i tereture.  The  selectivity  Is 
provided  by  matching  subject  areas 
of  each  document  with  user  "profiles" 
which  uniquely  define  users’  areas 
of  Interest.  Such  a  service  has 
firmly  established  rules  of  dociMient 
selection,  profile  matching,  and 
user  notification.  The  users 
serviced  by  an  SOI  service  may  be 
individuals  or  groups  of  individuals 
who  combine  their  interests  Into  a 
single  profile. 


C.  MAJOR  FI  NO  I  NOS  ANO  CONCLUSIONS 


I .  Divorsity  of  Ohjoctives,  Qutpuls, 

and  I’rocedures 

If  tiny  general  i nation  about  SOI  tan  be 
s.itely  supported,  it  is  that  the  surveyed 
sol  systems  vary  greatly  in  their  design 
and  operation.  While  meaningful  practices 
i)f  the  majority  can  often  be  found,  the 
exceptions  are  to«)  significant  to  be 
overlooked.  In  fact  these  exceptions 
often  point  out  very  useful  and  unique 
applications  of  5l)l  systems.  Most  of 
the  surveyed  systems  seem  to  be  unique 
in  at  least  one  area,  and  the  majority 
ut  systems  varied  from  parameter  to 
parameter.  Some  examples  of  the  diver¬ 
sity  among  systems  follow. 

The  existing  SUI  systems  serve  a 
great  variety  of  users  as  we  1 1  as  a 
(jroat  number  of  users:  scientists, 
engineers,  managers,  salesmen,  doctors, 
and  clients  with  any  information  need. 
Fully  operational  systems  serve  from 
1/  to  2B00  users.  The  "users”  are 
usually  individuals,  groups  artificially 
created  by  the  system,  research  labor¬ 
atories,  Individuals  forming  their  own 
group,  administrative  groups,  and  even 
ttimpanles.  Altogether  the  surveyed  SOI 
systems  process  anywhere  between  25  to 
SU,000  documents  per  month.  The  SOI 
systems  are  performed  as  often  as  daily 
and  as  seldom  as  once  every  2  months. 

Some  SOI  systems  are  automated  while 
others  are  semi -automated  or  nanual. 

While  some  SOI  services  depend  exclusively 
on  the  bibliograhic  tapes  produced  by 
major  U.S.  document  centers,  there  are 
just  as  many  who  handle  their  own  docu- 
mi>nt  accessioning  and  Indexing.  There 
in  a  great  variety  in  construction  of 
profiles.  While  some  offer  the  user  a 
choice  of  only  a  few  subject  categories, 
others  use  proflies  of  over  100  terms. 

Some  use  controlled  vocabularies  for  both 
profile  and  document  indexing,  while 
others  depend  on  free  language  indexing. 
M<<iny  profiles  ere  being  matched  against 
processed  literature  by  people  while 
many  others  use  elaborate  computerized 
matching  logic. 


All  this  diversity  demonstrates  that 
GDI  as  a  concept  is  easily  adaptable  to 
<3  great  variety  of  objectives  and  oper¬ 
ating  conditions.  Its  potential  appli¬ 
cations  have  hardly  been  exhausted.  For 
instance,  one  druq  company  uses  SDI  to 

keeo  its  doctors  and  salesmen  informed 

of  company  and  competitors'  products. 

The  Gui  concept  has  also  been  success- 
full/  applied  to  primary  distribu'^'ion  of 
laboratory  reports  and  is  likely  to  effect 
tne  tufuio  pattern  of  technical  Journal 
production  and  distribution. 

2.  Similarities  in  Development  Histories 

With  all  this  diversity  of  design 
there  is  a  remarkable  similarity  in  the 
origin  of  these  systems.  All  but  two 
have  been  established  as  the  subsystems 
of  other  documentation  services  seeking 
to  capitalize  as  much  as  possible  on  the 
existing  equipment  and  procedures.  More¬ 
over,  most  of  them  have  been  developed,  as 
the  services  to  the  R&D  community  in 
Government  and  Jisdustry.  Finally,  most 
of  them  began  as  "experimental"  systems. 

3.  The  Period  of  SDl  "Adolescence" 

Practically  all  operational  systems 
started  as  experiments  in  selective  dis¬ 
semination  so  as  to  be  better  able  to 
cope  with  the  uncertainties  of  the  new 
operational  mode.  We  also  suspect  that  in 
many  cases  the  term  "experimental"  was 
really  an  euphemism  for  managements' 
unwillingness  to  face  the  consequences  of 
an  unsuccessful  system.  Since  we  have  no 
data  on  the  death  rate  of  the  experimental 
systems,  we  are  unable  to  estimate  the 
chances  of  any  new  system  to  reach  the 
operational  status.  The  aval  I  able  data 
does  suggest,  however,  that  most  of  the 
systems  which  began  experimentally  are 
still  alive  -  mostly  because  they  were 
subsidized  oy  several  federal  agencies 
and  major  industrial  organizations.  There 
are,  of  course,  notable  exceptions: 
operational  systems  that  are  being 
supported  through  commerical  subscriptions 
like  the  ASCA  service  of  the  institute 
for  Scientific  Information  and  Scientific 
Documentation  Centre,  Ltd.,  in  the  U.K. 
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1..  hi 'isomi n.jt  ion  Aid  SDI 

d.  Library  Acquisitions  SDi 

e.  Group  Profile  SDI 

The  Peripheral  Vision  SDI  needs  1 1 1- 
tle  comment  since  It  Is  essentially  an 
SOI  service  in  which  the  whole  universe  of 
available  documents  is  being  scanned  and 
provided  to  the  user  according  to  the 
predetermined  hit  rules.  SDI  systems 
using  N/\SA  or  Chemical  Abstract  tapes  are 
prime  exponents  of  this  type  of  service. 

The  Focused  Vision  SDi  is  a  variant 
which  is  practiced  in  a  varying  degree 
where  the  literature  input  is  being  eval¬ 
uated  not  only  as  to  its  relevance  to  the 
genera!  area,  but  also  for  its  scientific 
or  technical  worth.  In  such  a  system  the 
user  is  relieved  of  the  need  to  decide 
which  of  the  papers  are  of  sufficient 
value  to  be  ordered  -  the  systems  oper- 
•iTors  act  as  his  filters  for  much  of  the 
material  which  otherwise  might  have  re¬ 
sulted  In  a  hit.  Because  of  this  pre¬ 
screening  the  user  usually  receives  only 
not  I ces  on  those  ! terns  wh I ch  are  be  1 1 eved 
to  be  most  relevant  to  his  needs  -  his 
attention  Is  being  "focused”  on  a  much 
narrower  segment  of  the  existing  I  Iter - 
•iture  which  may  lie  on  the  peripheries  of 
his  usual  literature  sources.  The  Bureau 
of  Reclamation  for  instance,  does  such  a 
morough  job  of  prescreening  documents 
that  only  S/.  of  received  documents  Is 
ontorec  into  Its  data  base. 

n;«.seml nation  Aid  SOI  has  been  ob- 
•unrved  u3  ar.  attempt  at  providing  a  more 
..vcuruTo  tool  for  the  dissemination  of 
tne  intornaify  produced  !iterature«  Nhen 
it  is  so  used,  the  sot  does  indeed  pro¬ 
vide  a  Kind  of  peripheral  vision  to  the 
user  since  it  can  be  easily  argued  that 
the  '.ompany-broduend  literature  usually 
lies  outside  of  his  regularly  scanned 
sooroas.  in  one  innovative  approach  an 
iicrospaco  company  is  using  the  SOI  method 
to  its  commercial  advantage  by  insuring 
th.1t  the  "right  kind"  of  technical  llter- 
.jturo  is  srnt  to  "appropriate"  Government 
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fMD  nvinnqers.  By  holdinq  up  before  fheso 
m.iriagor-i  the  relevant  company  work  it  is 
hoped  to  enhance  the  company’s  image  in  a 
given  technical  area  and,  consequently, 
increase  a  probability  of  future  R4D 
contracts. 

Library  Acquisitions  SDI .  This 
variant  was  found  In  only  one  of  the  sur¬ 
veyed  systems  but  it  is  a  most  interesting 
form  of  SDI.  Basically  it  serves  both  as 
a  screening  device  for  the  research  lab¬ 
oratory’s  librarv  aceuisitions  as  well  as 
a  method  of  notifying  tht*  liorary  users 
about  the  newly  received  documents.  This 
variant  empioys  profiles  of  Its  users  to 
determine  the  over-all  library  profile 
against  which  is  being  matched  bibliog¬ 
raphies  on  magnetic  rapes  of  major  U.S. 
documentation  centers.  Of  equai,  if  not 
greater  Interest,  Is  the  fact  that  this 
is  the  only  known  system  which  has  Justi¬ 
fied  Its  worth  by  contributing  to  the 
reduction  of  technical  library  acquisition 
costs.  By  avoiding  purchase  of  marglnal- 
Interest  literature,  the  cost  of  SOI  is 
being  rationalized  by  showing  the  evidence 
of  savings  which  are  possible  by  not  buy¬ 
ing  and  storing  ’’useless"  documents. 

Group  Profile  SOI.  SOI  was  original ly 
created  to  provide  service  to  individuals, 
by  kffieping  them  up  to  date  on  secondary 
fields  of  Interests  as  well  as  In  their 
own  occupational  fields.  The  individual 
Interest  profiles  tend  to  be  costly  how¬ 
ever,  and  require  a  lot  of  planning  and 
a  large  data  base.  Some  organizations 
unwilling  to  spend  the  time  end  money 
necessary  for  such  personalized  sen'ices 
have  moved  In  the  direction  of  group 
profiles  where  eech  profile  represents 
the  combined  Interests  of  a  single  pro¬ 
fessional  group.  The  groupings  can  follow 
several  distinct  lines:  groups  involved 
by  a  single  research  project,  groups 
having  common  academic  interests,  or 
groups  where  the  members  occupy  similar 
positions  In  the  avnpeny,  etc.  (ilKe 
selasmen  of  a  given  product  I  Inal. 

5.  Most  Services  are  Under-Used 

There  aro  two  significant  parameters 


in  measuring  the  use  of  a  system:  system 
capacity  (limits  determined  by  configura¬ 
tions  of  a  system)  and  the  number  of 
eligible  and  available  users  that  par¬ 
ticipate.  The  surprising  findings  of  this 
survey  are  that  all  systems  are  not  even 
close  to  their  capacity.  Moreover  only  a 
relatively  small  portion  {22%)  of  the 
eligible  users  depends  on  the  SOI  services. 
Despite  This  apparently  weak  showing,  ail 
systems  Surveyed  expected  to  continue 
Their  service.  It  should  also  be  noteo 
that  four  of  tne  big  complex  systems 
operate  with  between  80$  -  100$  of  their 
eligible  users. 

Why  the  many  services  are  not  more 
fully  utilized  cannot  be  answered  with  our 
data.  At  this  time  we  can  only  postulate 
that  the  inadequate  advertising  of  the 
services  is  one  of  the  contributory  causes 
for  the  minimum  use  of  the  SDI  services. 

6.  Optimum  Sizes  of  User  Population 

In  principle  SOI  concepts  and  method¬ 
ologies  cen  be  made  to  work  for  eny  number 
of  users.  The  wide  range  number  of  users 
found  by  this  survey  (IS  to  2800)  tends 
to  support  the  validity  of  this  contention. 

But  let's  be  more  specific  about  the 
optimum  number  of  SOI  users  per  service. 
Our  study  shms  that  experimental  and 
partially  operational  systems  usually 
work  with  a  small  number  of  users.  The 
average  found  In  this  survey  is  $2.  These 
users  supply  a  considerable  amount  of 
feedback  during  this  trial  period  until 
the  system  Is  ready  to  run.  Fully  opera¬ 
tional  in-house  systems  usually  operate 
with  400  or  fewer  users. 

The  reasons  that  systems  have  less 
than  4(X>  users  teem  to  come  from  other 
than  SOI  limitations.  Most  SDI  managers 
feel,  for  instance,  that  system  capacity 
was  not  being  approached  and  their  systems 
did  not  require  changes  to  accommodate  e 
marked  Increase  in  the  number  of  new 
users.  Reasons  that  are  more  plausible 
are  likely  to  be  machine  end  staff  time, 
priorities,  the  kind  of  subject  matter 
selected  for  document  inputs,  end  perhaps 
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v<  v  iiiilii  .ift'*,  lh<it  .1  -.III  pi  i'-iruily  '.nwill 
111  iisni*.  (If.-.  Ih.iii  /'»'/•  i>l 

I  lii|i|ilr  usi'i'.)  takes  advantaqe  of  the 
|!|  ■.y.li'mi.  two  •^y.lmns  indie. iled  Iha* 
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ii'..  I  ■.  wi-io  inilially  in  furor,  tod,  but 
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111  ii-.t'i ‘i  rapidly  i ni  re.r-ied  to  three- 
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7.  ‘'.oiirf'or  of 

It  is  possible  to  operate  an  SOI 
'.y.lom  without  having  an  in-house  library 
or  technical  Information  center.  About 
two- thirds  of  the  surveyed  systems  use 
other  sources  for  at  least  part  of  their 
.io>  iim(>nt  input.  Half  of  those  systems 
,uo  (lepondont  on  tho'.e  sources  for  lOOf 
of  their  document  input.  Two  options 
were  indicated  by  participants:  dis¬ 
semination  centers  that  usu.illy  supply 
m.i.|t'etic  tape,  and  -ontr.jctors  rhat  per- 
i.mti  index inq  .»nd  other  ttjsks  to  prepare 
m  I.  h  i  iM»-roady  record',  or  th.jt  perform 
the  entire  SUI  function, 

U'.inq  othor  sourcos  lor  document  in- 
pui  h.r,  sevor.il  siqnif  leant  advantages. 
>•1.1  tinq  and  prep.irinq  documents  for 
'■ill  is  the  biqqost  on-qoinq  task  in  this 
(.1 .1. 1  ••••.,  Ttie  time,  personnel,  and  cquip- 
r-»  III  •.  ivinqs  .)ro  immediately  obvious.  In 
.‘Hii.in  using  .i  dl'.s«’mln.ition  center's 
i>!  il  will  in  .ill  pf  otnihi  I  i  ty  put  some 
...I!  ir. lints  on  the  system  options  and 
lt..■^<•|.y  l  ilt  d.iwn  on  "xpensive  uxpnrS- 
»>•  .1  iilnn.  Hi  .-'■emin.it  Ion  centers  often 
!  •  ,,•!  Iw.iru  p.i«tK..  in.  .in.l  consul  ting 

.vi.i  ’.,  .if  Hioy  c.ti=  perform  the  entire 
.  tv!  <•  ii»-h.Mi'.e.  H-.Inq  a  contractor 
.  ihi*  s.rthi.  flmt',  personnel,  and 
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terms  -ind  dulivor-.  to  the  author  a  list  of 
n.imns  to  whom  tiis  paper  should  be  sent. 

I  he  author  is  responsible  for  sending  his 
p.iper  to  those  people. 

8.  Controlled  vs.  Free  Vocabularies 

A  significant  majority  of  SOI  pro¬ 
files  is  being  constructed  with  the  aid 
of  some  form  of  controlled  vocabulary. 

This  majority  (over  80<)  Is  In  turn 
divided  into  two-thirds  using  Its  own 
thesauri,  and  one-third  using  a  published 
version  by  such  organizations  as  DOC, 
National  Library  of  Medicine  or  the 
Engineers  Joint  Council. 

Of  the  computerized  systems  only  four 
use  free  language  input.  Document  input 
for  such  systems  Is  simple  and  allows  an 
Installation  to  input  documents  from 
several  sources  without  restructuring 
data.  User  profiles,  on  the  other  hand, 
become  long  and  complicated.  The  systems 
work  efficiently,  but  systems  using 
controlled  vocabularies  are  easier  to 
operate. 

In  little  more  than  ona-half  of  all 
systems  the  users  are  not  restricted  to 
any  limit  of  the  technical  terms  they  can 
use  in  thair  profile.  In  practically 
all  cases  profiles  are  edited  or  reworked 
by  the  systems  operators  Into  controlled 
vocabulary  terms. 

Only  one  system  practices  "auto- 
profiling",  where  the  profile  Is  con¬ 
structed  without  the  direct  choice  of  the 
profiled  people.  The  eutoprofl ling 
concept  entertained  by  various  people  In 
the  past  Is  apparently  no'''  finding  wide 
acceptance. 

9.  Individual  vs.  Group  Profiles 

All  sur/eyed  systems  managers  believe 
that  sni  serves  e  real  and  important  need 
of  screening  the  literature  for  special¬ 
ized  users,  and  they  Intend  to  continue 
some  form  of  current  awareness. 

There  are  some  signs  that  economic 


f.if'toro  .jttr.ic'  curt.iin  ‘‘lOI  opt'r-itors  to 
movo  mi^re  in  th«  diroction  of  qroup  pro- 
filos  .ind  .iw»<y  from  the  personul  i  zed  ser¬ 
vice  where  e.uh  collection  of  documents 
is  uniquely  m.itchod  with  the  Indlviduiil 
profile.  At  least  one  l.irqe  system 
(NASA)  is  having  second  thoughts  about 
tho  economic  feasibility  of  a  iarnc, 
fully  individual ized  SOI  service  and  has 
developed  a  now  program  called  SCAN  in 
which  computers  are  used  to  prepare 
specialized,  periodic  announcements 
based  on  group  profiles.  Similarly, 
our  own  office  stimulated  a  test  of  the 
group  profile  approach  which  resulted  in 
the  development  of  the  Clearinghouse 
notification  service  known  as  CAST 
(Current  Announcements  In  Science  and 
Technology).  There  Is  also  evidence 
That  many  individuals  receiving  person¬ 
alized  SOI  services  do  In  fact  repre¬ 
sent  group  interests.  By  establishing 
thoir  profiles  so  that  they  encompass 
the  interests  of  a  closely  related  group, 
they  act  as  the  front  men  for  the  ser¬ 
vice  for  a  number  of  Individuals.  This 
last  phenomenon  is  looked  upon  with 
favor  by  many  systems  managers  who  see 
iti  it  the  means  of  reducing  .Tiachlne 
processing  and  mailing  costs,  and,  hence, 
user  costs. 

Despite  the  seeming  attractiveness 
of  group  profiles  only  25%  of  the  systems 
use  them;  the  remaining  15%  use  individual 
profiles  as  the  bese  for  their  services. 

It  must  be  remembered,  however,  that  most 
of  the  presently  operating  systems  have 
economic  incentive  for  any  departure 
from  the  Individual ixed  profiles.  It  is 
interesting  to  note,  however,  that  in 
■.•*varal  cases  where  the  user  was  given  a 
t'loice  of  more  expensive  individualized 
sorvice  and  a  considerably  cheaper  group 
profile  service,  the  switch  to  the  cheeper 
mothod  runs  as  high  as  10%. 

10.  Operatlnft  Costs 

As  might  be  expected  there  is  a 
tondency  for  yearly  costs  per  user  to 
decrease  as  the  number  of  users  increase, 
although  not  at  an  even  rate.  Of  tne 


eleven  systems  that  supplied  accurate 
annual  budgets,  the  average  cost  of  five 
systems  with  user  population  of  200  or 
less  was  $234  per  user.  The  average  cost 
of  systems  with  population  of  200-2000 
was  $115.  Although  this  tendency  Is 
worth  noting,  the  sample  In  our  survey  is 
too  small  to  place  too  great  a  reliance 
on  The  cited  costs. 

Since  SOI  services  are  usicily  pro¬ 
vided  as  an  adjunct  to  other  information 
services,  and  since  we  were  unable  to 
find  a  typical  system,  the  survey  did 
not  permit  an  estimate  of  the  typical 
costs  required  to  establish  a  new  SOI 
service.  The  analysis  of  budgets  (where 
available)  and  Its  correlation  with  other 
data  permitted  us,  however,  to  construct 
an  Index  which  could  prove  useful  to 
future  systems  planners.  This  Index  is 
derived  from  the  calculation  of  the 
average  cost  per  user  per  run,  which  In 
our  survey  is  calculated  to  vary  between 
$1  and  $2.  This  Index,  unfortunately, 
carries  some  hidden  relationships  that 
could  not  he  broken  out  for  analysis. 

The  number  of  profiles  and  documents, 
matching  techniques,  and  systems  mainte¬ 
nance  contribute  to  the  cost,  but  none 
appears  in  the  index.  Perhaps  the  best 
way  to  use  this  Index  Is  to  determine  the 
number  of  users  and  the  SOI  budget,  then 
to  devise  an  SOI  service  and  determine 
its  cost  per  run.  The  Index  can  be  used 
to  ceiculete  systems  costs  likely  to  be 
incurred  in  the  operating  year. 

1 1,  Profile  Matching  Techniques 

Matching  is  the  focal  point  in  SOI. 
At  this  point  the  user  profiles  and  docu¬ 
ments  are  matched  to  determine  what  docu¬ 
ments  are  relevant  to  which  users.  There 
are  four  major  matching  techniques:  in¬ 
dividual  or  aggregatad  profiles  matched 
against  either  individual  or  aggregated 
document  profiles.  (Aggregated  means  a 
master  list  of  index  terms,  usually  with 
appropriate  tegs  to  get  back  to  the 
proper  user  or  document.) 

Three  major  matching  strategies  are 
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|i(  ,ii  <  lifH'.ir  ,  l)(x>l<-<iri  kMjic, 
mil  w)*i<|ltlO(l  lorm*..  Iho  piirpo'io  of 
ni.itiliini)  is  to  find  "lii  f’’.",  f.e.  events 
wtH'ic  t  )liH;unK‘ii1  Icim  s.il  isf  lo*.  Iho  user’s 
pr  'lilc*  lorm.  liiWMi  iMlchini)  mo<ins  that 
i(  .my  lorm  in  1ho  usor  profile  nwitches 
■my  form  In  Iho  dooumnni  record,  then  a 
hi  I  i'.  (loner.ilod.  (looloan  strateqy  uses 
•. I.i tomonl s  of  form*.,  ronnocfod  hy  and,  or, 
mil  iiol.  A  hit  ro(]uiro‘.  In.iT  the  entire 
-.1  iionH’iil  bo  satisfied.  Weighting  in- 
di  .ilos  that  the  terms  of  the  user’s 
pt'iilo  are  weighted  to  Indicate  relative 
i nipt >r lance.  Matching  operates  In  the 
•..inK!  manner  us  linear  nwtching  except 
th.it  a  running  score  of  weights  Is  kept. 
The  number  of  hits  required  to  generate 
.1  notice  is  arbitrarily  determined  by 
e.u'h  system. 

Approximately  of  the  systems 
iii  ili  hoil  individual  prof  i  los  aqai nst 
individual  documents.  This  approach  Is 
advi  .ahle  for  any  m.iti;hing  strategy  other 
Iti.m  linear  m.itchinii,  tind  most  of  the 
•.y  li'ms  with  boolean  logic  or  weights  use 
i!.  Line.ir  m.itchlni|  can  bo  performed 
ilirmsf  in  any  nvinner ,  hut  individual 
iu  1)1 1  If.  .igalnst  Irulividual  documents  is 
pr’id iTred  by  tho  surveyed  systems.  To 
u-.i'  .igijrfHj.itt'd  list*,  simplifies  the 
m.ii<t)it)q  but  adds  additional  sorting 
run*,  to  find  users  and  remove  duplicates. 

I?.  User  Notices 

Iho  typical  usor  notice  contains  an 
ah*.  It  at  t  and  is  machine-printod.  Half  of 
till-  .ys terns  usn  pun*  h  cards  or  double 
iiin  (.•d  i>unch  Cards.  'Hxnr  of  tho  remaining 
•,y-  use  cards  •'>1  other  Ih.m  punch  card 
'  raids  aio  praiimbiy  pfufcr.iblo  frr)m 
ft..-  u-.cr  standpoint  Ihumuso  they  are  more 
ilui  itih*  and  easier  to  file. 

In  tp'noi .il, user prefer  an  ^tbstract 
I  1  imtiio  IMS  Must*  itio  user  Mould  rathor 
p  n.'  fhi'  final  ilocisitm  aliout  the  docu- 
i:.<  I,  •  .  t.'lovanco.  In  .idditlon  since  the 
ill’,  •l'l•'nl  '•houid  (to  til  at  least  sono 
in f ,  I'oadiih)  tin?  abstract  wny  often 
-  n;  ,  iy  .‘iiongh  inform.irinn  without  taking 
t'-i,  ii  .i!  III.'  's  Vherofore,  docu¬ 


ment  delivery  as  the  form  of  the  notice 
does  not  seem  to  be  desirable  from  the 
user’s  viewpoint.  Systems  managers  are 
probably  Just  as  happy  because  documeni 
reproduction  costs  can  mount  up  quickly 
for  a  system  with  more  than  a  smal I  number 
of  users. 

Three  systems  deliver  documents 
as  the  user  notice.  These  systems  are 
designed  in  such  a  way  that  document 
delivery  seems  to  be  the  best  approach. 

One  system  uses  SOI  to  route  selectively 
technical  in-house  papers.  Another 
system  uses  task  profiles  to  screen  In¬ 
coming  documents  and  routes  25%  directly 
to  the  tasks.  The  third  system  uses  an 
SOI  approach  to  route  engineering  draw¬ 
ings. 

13.  Frequency  of  Notification 

Most  fully  operational  systems 
distribute  documents  either  weekly  or 
every  two  weeks.  Weekly  notices  are 
predominant.  A  few  send  daily  notices* 
and  a  few  send  monthly  notices. 

Frequency  seems  to  depend  on  system 
design  and  the  user's  expressed  needs 
rather  than  document  volume  or  SOI  budget. 
SOI  system  design  no  doubt  reflects  the 
time  made  available  for  the  service  as 
well  as  the  Interpretation  of  the  users’ 
information  needs. 

Experimsntel  and  partially  oparation- 
al  systama  oparata  in  a  variat/  of  ways. 
Soma  sand  vary  fraquant  nottcas  as  dlf* 
farant  paramatars  ara  triad  on  tha  sama 
data  basa.  Soma  syatams  distrlbuta 
notices  much  less  fraquant iy  than  they 
intend  to  whan  tha  system  reaches  oper¬ 
ational  statue. 

I’**  ProfUa  Modification 

All  SOI  syatams  have  provisions  for 
profile  modification.  All  surveyed  sya- 
tems  not  only  modify  prof  I  lea  on  uaar 
request,  but  also  have  soma  routine 
method  of  modification  which  does  not  re¬ 
quire  direct  user  objection  to  non-re  I e- 
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Vinnt  announcements. 

The  majority  of  systems  (80/K)  relies 
on  users  to  indicate  profile  changes. 

Most  of  these  systems  ask  users  to  mark 
the  returnable  portion  of  the  announce¬ 
ment,  Approximately  2G%  of  the  systems 
Initiate  profile  modification  by  sched¬ 
uling  modification  at  fixed  time  intervals. 

Only  one  system  has  an  automatic 
modification  scheme.  Whereas  users  in 
other  systems  indicate  the  changes  to  be 
made,  users  of  this  system  indicate  how 
appropriate  the  announced  document  is. 

The  system  modifies  weights  of  terms 
relevant  to  the  notice  and  terms  that 
matched  but  did  not  generate  a  notice 
with  no  further  Interaction  with  the 
user. 

n.  SELLCTED  EXAMPLES 

In  this  section  we  present  brief 
(Inscriptions  of  several  operating  systems 
whicit  could  be  considered  as  useful  pro¬ 
totypes  for  future  SDI  systems. 

I ,  Computerized  Focused  Vision  SDI 

The  system  at  the  U.S,  Bureau  of 
Reclamation  has  about  2000  j uni on  and 
senior  civil  and  mechanical  engineers 
and  physical  scientists.  Each  month 
about  1250  documents  are  reviewed  by 
scientists  In  the  field,  and  about  SO 
(41t)  are  inputted  into  the  SDI  system. 
Inputting  averages  6  1/2  hours  per  docu¬ 
ment,  Document  Indexing  averages  20 
terms. 

This  system  inputs  only  At  of  the 
documents  reviewed  so  that  only  partic¬ 
ularly  cogent  documents  wl  I  i  be  I'^putted. 
Men  who  are  professionals  in  their  field 
read  and  select  appropriate  documents, 
and  these  documents  are  passed  on  to 
superiors  who  also  eliminate  documents. 
Hence,  the  documents  finally  inputted 
are  highly  specialized  and  "focused." 

This  closely  OKitrolied  type  of  input 
is  apt  to  result  In  a  high  input  time. 


User  profiles  are  constructed  using 
the  Bureau  of  Reclamation  Thesaurus.  The 
profiies  are  limited  to  20  terms  and  no 
more  than  4  of  these  terms  may  be  coded 
with  an  *  to  indicate  special  Importance. 
The  user  constructs  his  profile,  but  it  is 
reviewed  by  the  staff  and  altered  If  nec¬ 
essary  to  reflect  what  they  believe  his 
needs  to  be. 

Matching  is  linear.  If  3  terms  or 
I  special  term  match  the  document  terms, 
a  notice  is  generated.  SDI  runs  are  made 
monthly.  The  user  receives  an  abstract 
or.  a  punch  card  and  a  form  for  requesting 
the  full  document.  The  system  provides 
1005  document  and  microfiche  back-up. 

Profile  modification  is  primarily 
handled  by  staff  who  analyze  the  results 
of  each  output.  User  complaints  are 
treated,  but  they  are  usually  rare. 

2,  Manual  Peripheral  Vision  SDI 

The  Scientific  Documentation  Centre 
based  in  Dunfermline,  Fife,  England,  is 
a  commercial  SDI  system.  The  system  has 
a  particularly  large  document  volume, 
300,000  Items  yearly,  selected  from  a 
wide  range  of  sources  and  countries 
covering  a  wide  range  of  topics.  Docu¬ 
ments  include  all  of  U.S.  Government 
Research  and  Development  Reports  and 
appropriate  parts  of  U.S.  and  British 
Thesis  Titles.  The  system  started  four 
years  ago  solely  for  SDI.  Gradually 
the  topics  are  being  made  available  for 
retrospective  searches  also  so  that  about 
half  are  included  In  a  storage  bank  at 
present.  Science,  technology,  medicine, 
botany,  zoology,  and  engineering  are  some 
of  the  fields  Included.  Fees  vary  accord¬ 
ing  to  topics  selected.  Most  topics  cost 
between  fifteen  and  fifty  pounds  per  year. 
Many  of  these  prices  are  less  than  Amer¬ 
ican  systems. 

A  potential  user  reviews  the  list 
of  topics  available  and  Indicates  his 
choices.  The  staff  at  the  Centre  con¬ 
structs  the  profile.  The  user  receives 
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wcl'kiy  iiolifc-.  on  .1  cirird  ot  .1  punch 
<  ,11  il  Krordiiiii  lo  liis  proforonco.  1  .i«:h 
lonl.iiti'i  d  cihilion  und  keywords,  ihe 
u'.cr  ttuiy  order  tho  document,  but  ordering 
docuiTK;ntr.  requires  on  additionai  fee. 

iipocific  facts  .ibout  users  and  pro¬ 
file  construction  are  considered  confi- 
df'fili.il,  as  aro  the  workings  of  the 
syJom.  This  system  is  operated  without 
horu'lit  of  computers.  A  staff  of  forty, 
mostly  students  who  work  part-time,  main¬ 
tains  the  SOI  service,  establishing  that 
manuni  systems  are  not  limited  in  size 
or  capabi I i ty. 

(iroup  I'rof  I  les 

NASA/SCAN  is  an  experimental  system 

lh. il  permits  selective  dissemination  to 
l.iific  numbers  of  users  in  an  efficient 
.>nd  inexpensive  manner  by  modifying  some 
of  llu^  standard  SDI  techniques.  Instead 
ol  i^reating  individual  profiles  to  meet 
1hc  information  needs  of  individual  users, 
SCAN  offers  a  list  ot  |89  topics,  the 

pro! lies  for  which  are  already  constructed. 
A  p<vtential  user  selects  his  areas  of 
interest  from  the  list  and  informs  NASA 
of  iii'i  decision.  This  type  of  approach 
«,eem-.  to  satisfy  both  the  users  and  the 
systems  staff.  Profile  construction  is 
f|ui  le  often  very  tiriKi-consumlng  and 
expensive  because,  although  the  user 
know,  what  ho  wants  in  receive,  the  user 
■  liiiuil  alway^  reflect  his  needs  through 
iiid.  ^  terms  tie  chooses.  His  profile 

li. 'iin'.  to  reflect  Ills  needs  aftor  several 
Ml  I  runs  and  c.onferencos. 

It»;  user  solocis  one  or  several  of 
tii<  .CAN  topics,  and  his  name  Is  added 
to  Hie  list  for  each  of  the  topic  pro¬ 
files.  Ho  spends  a  minimal  amount  of 
efiei  t  and  time  to  participate  In  SCAN. 

Ai  piesont  500  users  interests  are  re¬ 
tie*  tod  by  180  profiles.  Profile  modi- 
fli.ilion  merely  involves  adding  or  de¬ 
leting  topics  and  can  be  done  at  any 

I  i  MX  •  . 

the  data  base  is  Made  trom  articles 
in  I  AH  Clcicntlflr  .ind  Technical  Aero¬ 


space  Abstracts)  and  contain  about  5500 
documents  per  month.  Those  ar'*icles  re¬ 
quire  1.5  hours  of  Input  time  per  docu¬ 
ment  which  Includes  abstracting. 

Document  records  and  Interest  pro¬ 
files  are  matched  twice  a  menth.  The 
interest  profiles  use  weighted  terms  to 
simulate  boolean  logic.  A  notice  is 
generated  when  the  document  terms  satis¬ 
fy  one  logic  statement  In  the  Interest 
profile.  A  user  notice  contains  tho 
document  citation  and  Index  terms  with 

a  returnable  document  request  form. 

• 

4.  Library  Acquisitions  SDI 

Fort  Bel  voir  (U.S.  Army  Mobility 
Equipment  Research  and  Development  Center) 
has  an  approach  to  SDI  unique  to  this 
survey.  This  system  pays  for  itself  by 
screening  potential  documents  to  include 
only  relevant  documents  in  the  Technical 
information  Division  library.  The  money 
saved  by  not  purchasing  irrelevant  docu¬ 
ments  more  than  pays  for  the  system. 

There  are  approximately  225  research 
tasks  at  Fort  Be  I  voir.  Interest  profiles 
are  created  to  reflect  the  work  of  each 
task.  The  profiles  are  constructed  using 
an  internal  vocabulary  based  on  the  DDC 
thesaurus.  The  profiles  are  cumulated 
into  e  master  profile  for  the  division, 
and  all  documents  are  matched  against  the 
prof i le. 

Documents  that  are  accepted  into  the 
library  are  re-evaluated  by  a  staff  of 
technical  Information  special  I sts.  These 
people  decide  whether  to  route  the  docu¬ 
ment  to  the  appropriate  tasks,  announce 
it  in  a  bulletin  of  recent  acquisitions, 
or  both.  Documents  (not  notices)  ere 
routed  directly  to  the  tasks  If  they 
adequately  meet  the  task  requtremants. 
Documents  of  a  more  general  but  related 
interest  are  announced  in  the  bulletin. 

5.  Free  Language  Computerized 

SYgrtw  ^ - 

The  SDI  system  at  Ames  Laboratory, 
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li:.AI(\  low.)  t.)to  III) i Vfirsi ty ,  ran  tie  con- 
si  dorod  a  functional  experiment  for  .1 
regional  and/or  national  non-special i^od 
( urrent  awareness  service.  Few  limits 
.ire  put  on  the  user  or  on  document  input; 
cost  Is  low  (between  H  and  I0<  per  noti- 
fioation),  and  participation  Is  good,  in 
1965  participation  jumped  from  1/3  to 
^/4  of  eligible  users  (senior  scientists 
only)  within  six  months  after  the  sci¬ 
entists  saw  the  benefits  to  be  gained, 
ffccently  Ames  SOI  coverage  has  expanded 
to  include  industrial  departments  and 
corporations  (included  by  the  State 
Technical  Services  Act)  and  university 
departments,  as  well  as  individual  sci¬ 
entists.  Currently  there  are  220  inter¬ 
est  prof  1 les. 

Profile  vocabulary  Is  not  controlled. 
Users  may  list  authors,  journals,  foreign 
language  terms,  synonyms.  They  may  use 
word  clusters  of  up  to  six  words.  They 
may  uso  truncating  and  extension.  They 
m.iy  use  negative  words  and  word  clusters 
used  for  total  negation.  Terms  are 
weighted  between  0  and  I  to  four  sig- 
r)if leant  digits,  initially  by  the  user 
and  subseauentiy  modified  by  the  system. 
Long  profiles  are  encouraged,  and  the 
average  Is  130  terms  (words  or  clusters). 

Document  input,  6500  -  7500  documents 
per  week,  comes  from  many  sources;  there¬ 
fore  input  Is  not  uniform,  but  ranges 
from  title  and  author  only  to  abstract  to 
text.  There  is  no  practical  system  limit 
to  the  length  of  a  document.  The  system 
has  been  designed  to  adapt  to  assorted 
input  by  varying  the  hit  level  document 
by  document  (different  from  any  other 
system  so  far)  according  to  the  number 
of  terms  available  for  matching.  If  only 
titles  are  given,  the  level  Is  0.3000;  for 
abstracts  the  level  Is  0.5000;  and  when 
texts  are  used,  the  hit  level  is  0.7500. 

After  the  user  returns  his  Port-a- 
Pun^  card  indicating  his  interest  in  the 
cited  document,  the  weights  of  his  pro¬ 
file  terms  are  increased  or  decreased  or 
are  left  unchanged  automatically  by  a 
computer  program  used  for  profile  modi¬ 


fication.  For  a  positive  response,  the 
weights  of  the  matched  user  profile  terms 
are  increased;  for  negative  responses, 
the  weight  of  the  matched  terms  are  de¬ 
creased.  When  profile  terms  match  docu¬ 
ment  terms  but  do  not  generate  a  hit,  the 
weights  on  these  terms  are  increased  but 
at  a  slower  rate.  This  method  of  modi¬ 
fying  weights  is  so  sophisticated  that 
those  users  who  wish  to  adjJst  their  own 
weights  soon  give  up  and  let  the  system 
do  it.  The  breadth  of  document  coverage 
is  too  much  for  one  man  to  try  to  analyze 
and  control  the  weighting  strategy  of  his 
prof i le. 

Ames'  SDI  system  takes  3  1/2  hours 
per  week  running  time  on  an  IBM  360/50 
and  IBM  1401. 
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PART  II  -  STUDY  DATA 


A.  1)1  I  AILED  ANALYSIS  OF  DATA 

The  SDI  questionnaire  found  in  Appen- 
(li^  A,  h.is  R  sections  that  pertain  to  8 
p.ii  .iintjlers  to  be  ancily^ed  by  this  survey: 
i)f'iu!ral  factors,  users,  documents,  pro- 
fili!'.,  prof  i  le  mod i  f  ication,  matching, 
u'.cr  notice, and  equipment  and  personnel. 
llii‘.  portion  of  the  report  follows  the 
for  III.  1 1  of  the  questionnaire. 

I .  nenoral  Considerations 

this  section  covers  background  infor¬ 
mation  for  each  SOI  installation. 

Questions  I  and  .4,  name  of  organ  I - 
/.it  ion  and  furm<il  title  of  SOI  system, 
wfi"  only  tor  records  and  will  not  be 
i|i  .1  ii--,r.n<l.  The  remaining  questions  will 
III'  presented  .is  follows: 

Question  7.  E.ich  organization’s 
dff Ini  lion  of  the  SDI  concept. 

Qiioslioii  4.  lonqtli  of  time  !'<0I 
systoni'.  h.wo  been  in  oporatlon. 

Qui'-.lirvi  Mil  budget. 

i.»u..',tion  (>,  '.111  .Vi  purl  of  library 

i.»i  l.irgdr  information  center. 

Qu«*slion  /.  Operational  level. 


Question  7.  I  .ich  organization’s 
.i.'i  Ini  I  ii'n  of  the  jOI  concept. 

It  w.is  expected  that  major  differ- 
.  r,  .  v.  hoiwi^n  systoms  could  bo  traced 
'  i.  »•  to  f.ii  ii  -trgani /ritlon’s  Intnrpret.afion 
<>!  111.'  .il  ..'.i.  fpi;  lurnm  inc  aef  i  ni  t  ions 


supp I i ed  were  to  be  used  as  the  framework 
for  analyzing  each  system.  However,  In 
most  cases  formal  generalized  definitions 
rather  than  individual  working  definitions 
were  supplied,  and  no  real  insight  Into 
the  uniqueness  of  each  system  was  gained. 
All  definitions  received  did  fall  within 
the  limits  of  the  survey  criteria.  The 
individuality  and  purpose  of  each  system 
were  more  apparent  from  answers  to  the 
rest  of  the  questionnaire  and  through 
discussion  with  participants. 

Question  4.  Length  of  time  SOI 
systems  have  been  in  operation. 

Two  formal  systems  Included  in  the 
survey,  Crerar  Library  (System  4)  and 
M&T  Chemicals  (System  38)  have  been  oper¬ 
ational  prior  to  the  name  SDI.  Since 
1958  when  Luhn  presented  the  SDI  concept, 
the  number  of  systems  has  Increased 
steadily  each  year.  Figure  I  shows  the 
cumulative  growth  of  SDI  since  1958  as 
represented  by  thirty-seven  of  the  sur¬ 
veyed  SDI  systems.  The  graph  shows  both 
the  number  of  systems  operating  each  year 
and  the  number  of  new  systems  added  each 
year.  One  system,  B.F.  Goodrich  (System 
37),  is  excluded  from  the  graph.  The 
system  has  been  fully  debugged  and  Is 
capable  of  being  fully  operational  but 
at  present  Is  non-operatlonal  owing  to 
lack  of  available  personnel.  U.S.  Army 
Natick  Laboratories  SDI  system  (System  9) 
is  non-operations I  at  present  but  Is 
nevertheless  included  in  the  growth  curve. 
Natick  Labs  has  just  completed  a  nine- 
month  pilot  test  started  in  the  fall  of 
1966,  and  the  results  are  being  analyzed 
fit  present  in  order  to  Implement  a  fully 
operational  system. 

At  present  we  believe  the  number  of 
SOI  systemt  Is  smell  compared  to  the 
.lumber  of  potential  systems.  The  service 


Partially  operational  «:ystemB  started 
Fully  operational  systems  started 
Experimental  systems  started 

Cumulative  Growth  Curve 
Number  of  systems  in  operation 


prior 


FIGURE  1.  Growth  Pattern  of  Surveyed  Systems 
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provided  hy  M)l  soeiir,  It)  serve  .1  Vt?ry  im- 
porl.int  iioed.  We  Irolieve  Ihtil  Sf)!  syslenis, 
or  .it  le.ist  the  service  provided  by  SDI, 
will  continue  to  be  dovolopod.  Our  opin¬ 
ion  is  supported  by  survey  participiants, 
some  of  whom  expressed  dissatisfaction 
with  and  expected  to  change  operations, 
but  not  cancel  the  system. 

Question  5.  f'.iU  budget. 

Yearly  budget  figures  were  received 
from  seventeen  of  the  thirty- 

I'ight  systems,  with  government  systems 
siipplyitK]  the  m.ijnrity.  SDI  installations 
wet  i'  divided  inio  three  categories:  gov- 
ornimint,  priv.ite  institution,  and  ur.ivor- 
sity  SDI  systems.  (See  Appendix  B). 

Budget  figures  were  received  frorr. 
nine  of  the  fifteen  government  systems, 
seven  of  the  twenty-two  private  insrl- 
tulions  and  ttie  one  university  system, 
the  budrjots  r.inrje  from  "free  plus  two 
hours  of  clerical  time  per  month"  to 
1. 1  MO, 000  per  year.  The  "free"  SD! 
sy.lem  IxBongs  to  ttie  U.S.  Naval  Weapons 
laboratory,  lor  I  heir  willingness  to 
lake  part  in  the  NASA/SCAN  experiment 
anil  supply  NAMA  with  statistics  and  other 
(I't’dback,  NASA  is  absorbing  the  cost. 

Ihc  only  cost  incurred  by  the  Weapons  Lab 
I  i In  ary  is  for  one  hour  of  clerical  time 
fill  each  SCAN  run,  made  twice  a  month, 
li.  m.ii  I  nofices  to  users. 

1 1!  somi'  •  .iser.  wo  wore  told  that 
hmi  ji'l  informal  Ion  w.is  private  and  could 
iiol  ho  rolnasr'd.  Iri  olhor  cases  SDI  was 
an  inlogr.il  part  of  a  l.irgcr  system,  and 

111..  Kisi  could  n<i1  be  broken  out  of  the 
till  a  I  Iniilgof.  Ivr-n  for  the  budget  figures 
10.  i-ivoii,  compatibility  boNtoon  the  fig- 
ur- w.i*.  dllficuM  to  judge.  An  SOI 
-y.lom  Is  often  pl.innod  around  existing 
o.ji.ipmotit ,  pr iMjr.am*.,  personnel,  and  time. 

Ill  11  o  I  i. lures  may  include  salaries 

ill  sohm'  lasos,  parti.il  salaries  in  others, 
.1...1  no  s.ilaries  in  others,  depending  on 
h.iw  o.H.h  installation  interprets  SOI 

1.1.1.  A  itet. riled  «ost  .inaiysis  of  aeita 
riMi-ivod  is  presented  in  Burt  VII. 


Question  6.  Part  of  a  larger  system. 

SOI  systems  are  subsystems  of  a  li¬ 
brary  or  data  storage  bank.  Two  except¬ 
ions  were  found  in  this  survey:  National 
Cancer  Institute  and  Vitro  Laboratories. 
The  former  developed  SDI  as  a  primary 
system;  the  latter  operates  a  manual  SDI 
system  primarily  to  distribute  engineering 
drawings  outside  of  its  computerized  in¬ 
formation  center. 

Question  7.  Operational  level  of  SDI 
systems . 

B.F.  Goodrich's  inactive  but  fully 
operational  system  is  excluded  from  any 
of  the  three  choices  this  question  offers. 
The  remaining  systems  are  divided  as  fol¬ 
lows: 

Fully  operational  65K  (24  systems) 

Partially  operational  8f  (3  systems) 

Experimental  27%  (10  systems) 

TOTAL  I oof  37  systems 

Comparing  lengths  of  time  experi¬ 
mental  and  part I  a ly  operational  systems 
have  been  functioning  seems  to  Indicate 
that  SDI  systems  need  between  two  and 
three  years  to  became  fully  operational. 
Since  SDI  is  a  new  concept  and  the  number 
of  fully  operational  systems  Is  relatively 
small,  there  is  little  precedent  on  which 
new  systems  may  rely.  Hence  what  seems 
to  be  a  long  period  of  trial  and  partial 
operation  becomes  a  necessity,  as  most 
systems  experiment  with  several  SDI 
components  before  developing  fully  oper¬ 
ational  systems.  One  of  the  more  inter¬ 
esting  and  significant  findings  In  this 
survey  is  the  variety  of  approaches  to 
SOI  that  have  been  developed. 

2.  User 

The  following  topics  are  covered: 

I.  Population  limit  of  the  system 


iind  the  present  number  of  users 

2.  Population  composition  -  Mana- 
qorlal  and  Other 

3.  Number  of  eligible  users 

4.  Profiles  reflect  interests  of 

a.  Individuals  and/or 

b.  Groups  related  by 

1.  Discipline 

2.  Task 

3.  Administration 


Question  I.  Population  limit  of  the  sys¬ 
tem  and  the  present  number  of  users. 

i’opulafion  limits 

It  is  assumed  that  each  SOI  system 
has  some  finite  user  limit  beyond  which 
the  system  becomes  impractical  to  operate 
In  the  present  manner  on  existing  equip* 
tmint.  This  question  is  designed  to  evai* 
u.ite  system  utilization  to  aid  in  the 
determination  of  the  acceptance  of  SOI. 

I  he  thirty-eight  participating  systems 
answered  the  population  limit  portion  of 
the  question  as  follows: 

Thirty-two  systems  were  open-ended. 

Six  systems  had  a  fixed  population 
limit. 

The  thirty- two  systems  have  modified 
the  meaning  of  open-ended.  These  systems 
are  designed  so  that  the  present  nu^er 
of  users  and  the  anticipated  number  of 
users  do  not  put  any  Strain  on  the  sys¬ 
tem.  tkince  it  is  not  necessary  for 
these  systems  to  consider  a  maximum 
number  of  users.  Two  systems  that  did 
define  system  capacity,  Bonneville  Power 
Administration  (System  17)  and  Ames 
Laboratory  (System  20),  should  both  be 
considt'red  open-ended  within  the  context 


used  by  the  thirty-two  systems  above. 

Both  operate  significantly  below  capacity 
at  6.811  and  2.2|  respectively;  considering 
both  systems  operating  with  maximum  eli¬ 
gible  users,  the  percents  of  capacity  are 
30.0  and  5.0  respectively.  These  per¬ 
centage  figures  are  low  enough  to  justify 
including  these  systems  in  the  group  with 
"open-ended"  user  limits. 

Four  of  the  six  systems  that  defined 
population  limits  were  not  referring  to 
system  capacity.  These  four  experimental 
systems  have  a  fixed  user  limit  only  for 
the  duration  of  pilot  studies  in  order  to 
control  effectively  a  manageable  number 
of  users  for  test  purposes.  All  four 
systems  will  become  "open-ended"  when 
they  become  fully  operational. 

Our  analysis  Indicates  that  the  SDI 
systems  In  operation  expect  to  be  able 
to  serve  many  nore  users  than  they  anti¬ 
cipate  having.  Part  of  the  reason  may 
be  derived  from  the  fact  that  SDI  is 
usually  a  subsystem.  The  equipment  used 
by  an  installation  Is  large  enough  to 
accomodate  the  main  system  so  that  system 
capacity  Is  much  larger  than  SDI  requires. 

Population  Size 

SDI  systems  have  been  established  to 
serve  a  variety  of  population  sizes.  The 
broad  range  of  user  populations  Indicates 
that  SDI  itself  does  not  precipitate  a 
model  population  size;  rather  any  group 
of  potential  users  may  devise  an  SOI 
system  to  meet  its  needs.  In  order  to 
analyze  the  number  of  SDI  users  meaning¬ 
fully,  the  measurable  SDI  systems  were 
divided  into  three  groups. 

Experimental  and  Partially 

Operational  13  systems 

Group  !,  Fu I ly  Operational  16  systems 

Group  II,  Ful ly  Operational  7  systems 
TOTAL  36  systems 
No  response  2  systems 
58  svstems 
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fpr'tKirn I  I y  (jxpm  i inont.i I  sysloms  nper- 
ill’  wilti  .1  nutiilicr  of  users.  The 

ir.ct  '.  .ite  spfM  i  f  ly  soloctod  to  supply 
■.l.il  i'.l  i<  s  .iikI  other  (ocsdh.ick  tf)  tho  sys- 
l>■t^  (liv.i<|tint  lli.il  will  l)<’  unnecessary 
oiiic  Ihe  '.ystein  is  fully  imp  I  omen  led. 
leu  of  tho  Ihirtoon  systems  fiavc  sm.j|l 
ii'.i't  populatifuis  with  a  moan  average  of 
users.  The  other  three  systems  have 
populations  of  hOO  (NASA/SCAN  experi- 
tiieiil),  500  (N.iv.il  W('.)pons  Lahorotory 
it!\iilved  in  NA''.A/r.CAM )  and  flOO  (floll 

i. ‘l<’ption<;  Laboratories'  partially  oper- 
•  ilioti.tl  system). 

Ihe  tully  oper.j| ‘or.al  syctems  were 
r.iuked  in  oriler  of  user  si/c  t!xc lading 
Iwo  sy. terns  that  were  unable  to  answer. 
Ihoie  w.is  a  n.itural  break  between  400 
u'.f'rs  .jn<l  700  users.  Oroup  I  Includes 
/D'/i  of  ttio  fully  operational  systems  and 
h.v'.  a  r.mijo  from  1^  to  400  users.  Oroup 
II  urntaius  .ev(5n  l.ir'ije  syslonis  witn 
jip'IHil.ttions  wiaely  srattenHl  from  700 
to  /HOO  user*.  ,ind  are  considered,  there¬ 
fore,  .itypic.ii,  but  novortholoss  slgnlfl- 
e  Hit . 

ilH'  lac-gesf  syslem  in  our  survey, 
ll'M's  torporaft'-wid(>  system  located  at 
Aimonk,  New  York,  <u>i  ves  .is  a  g<xKJ  net- 
woi  k  liir  distt  ihul  in-i  lltM  Information 
b.  ill  interesled  personnel.  It  is  likely 
isil  liieii*  .lie  iiol  many  lnst.il  lations 
i!i.i  ne«sl  .in  .ilertii'i  syslitm  this  big. 

ii. .w  'vei^  itii.  liiM  system  exists  and  there- 

i  .  . t.ihl  istii's  the  tc.'siblllty  of  such  a 

sys 'em. 

tifii  other  system  has  boon  designed 
f"  ■■.i.ihlish  tho  foasihilily  of  large 
. .  Mfr.il  i/etj  dissomln.it  Ion  rm+firs.  In 
tii‘  .  e.r...  unr«'strlctr*d  In  subject  coverage 
(  •(•  •.  lahi.ratory  wl  ii  syst-'m  capacity  of 
i  <,.H)n  us..fs),  11  nbiy  i»e  that  effective 
.  .it.ll  i,s*d  dlsstimiti  itlon  nystoms  will 
!  .  .•  it.ii>i  islu'il  In  trii:  future,  and  with 
liis..  ..h.it  ini]  !»  ommlrf)  more  common 
(|.•t...lps  ri’jii.ii  itii]  individual  SOI  Install- 
ins.  i’.ul  t*.r  the  prr'.ont  time  at 
|.  1  .1  our  suivi'V  indii.ites  tiMt  most  SOI 
•y  SI  rve  .l-H)  or  fi*wiir  usors. 


Cost  and  Users 

There  is  a  slight  tendency  revealed 
by  our  survey  that  as  the  number  of  users 
In  each  SOI  sys+em  increases,  the  cost 
decreases.  Fourteen  systems  could  be 
tested,  and  we  found  that  systems  with 
200  users  or  less  have  an  average  yearly 
cost  of  1234,  and  systems  with  220  to 
2000  users  have  an  average  cost  of  $115. 
We  believe  that  this  finding  Is  somewhat 
inconclusive  because  the  user  Is  not  a 
significant  factor  in  SOI  systems  when 
considered  separately. 

Question  2.  Population  Composition 

The  purpose  of  this  question  mss  to 
ascertain  what  portion  of  SOI  users  were 
managers.  The  original  choice  offered 
to  participants.  Managerial  or  Bench 
Scientist,  was  too  restr I ct I ve .  bur 
survey  ^ound  that  SOI  Is  not  limited  to 
scientific  personnel  In  research  lab¬ 
oratories  but  also  Includes  doctors  of 
medicine,  engineers,  and  salesmen,  as 
well  as  personnel  In  other  fields  where 
Bench  Scientist  does  not  accurately  de¬ 
scribe  the  user.  As  pert! cl  pants  were 
interviewed,  this  question  was  presented 
as  Managerial  or  Other,  and  the  change 
was  made  on  subsequently  prepared  ques¬ 
tionnaires.  in  addition  participants 
wore  told  that  the  term  Managerial  was 
not  I imi ted  to  administrative  personne I , 
For  Instance,  it  is  assumed  that  resaarch 
projects  staffed  by  Junior  and  senior 
scientists  have  a  "Manager"  who  Is  him¬ 
self  8  scientist. 

A  total  of  62S  of  the  surveyed 
systems  serve  both  menagerie i  and  non- 
ranagerial  personnel.  Of  the  remaining 
seven  systems  two  systems  (System  3 
and  4)  distribute  documents  to  managers 
only,  and  five  (Systems  7,9,17,28  and  32) 
distribute  to  non-managerlal  personnel 
only.  Both  systems  distributing  to 
marager*«  only  use  group  profiles,  end  the 
managers  redistribute  notices  within 
their  respective  groups.  Of  the  five 
groups  that  distribute  to  non-manegerlel 
personnel,  three  will  distribute  to 
managers  as  welt  when  the  systems  become 


IS 
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fully  operd  t  i  on<j  I  (Sysloms  9,28  and  32). 

Evaluating  all  the  responses  both 
managerial  and  non-manager  I  a  I  personnel 
take  or  will  take  advantage  of  SOI 
services  In  thIrty-sIx  of  the  surveyed 
systems.  Evidently  there  Is  enough  of 
.)  common  Interest  between  these  two 
qroups  that  both  lake  advantage  of  SDI, 
or  else  the  scope  of  the  data  base  is 
broad  enough  to  meet  the  needs  of  ai I 
types  of  users. 

Question  3.  The  number  of  eligible 
uuers. 

It  Is  desirable  to  compare  the  num- 
ber  of  users  to  two  ways:  with  respect 
to  system  capacity  (Question  I)  and  with 
respect  to  el igib le  users.  Eligibility 
is  determined  by  each  instaliatlon  and 
is  considered  as  separate  from  system 
rapacity.  Criteria  include  education 
(one  system  for  instance^  requires  an 
MS  or  bettor),  field  of  Interest  (limited 
by  document  Input),  occupation  and  job 
level.  Some  installations  had  no  real 
requirements  except  that  the  users  be 
(irofessionals.  Seven  of  the  systems  are 
designed  In  such  a  way  that  the  number 
•<f  eligible  users  is  not  relevant.  They 
■tro  systems  that  work  for  clients  (e.g., 

(  rerar  and  Aerospace  Research  Applications 
Center)  or  that  are  open  to  very  large 
segments  of  the  population  (e.g.,  the 
'■>01  system  at  National  Cancer  Institute 
will  be  open  to  all  biomndlcal  scientists 
involved  In  cancer  research). 

The  question  was  applicable  to 
thirty-one  systems,  but  five  of  these 
systenm  were  unablo  to  supply  data.  The 
(lercent  of  eligible  users  that  use  SOI 
was  determined  for  the  twenty-six 
measurable  systems. 

4  systems  B0f>  -  lOOjC  of  eligible 
users 

22  systems  mean  of  I6f 

It  would  seem  that  SOI  should  be  a 
boon  to  persons  overloaded  with  new  lit¬ 


erature  and  the  need  to  keep  informed. 
However,  only  four  systems  include  be¬ 
tween  80K  and  I  OOjt  of  eligible  users . 

The  Bureau  of  Reclamation  (System  18) 
includes  80K  of  their  eligible  users 
and  Is  the  one  system  of  these  four  that 
has  only  individual  profiles.  The  Bureau 
is  also  the  largest  system  in  this  group 
with  2000  users  and  2000  user  profiles. 
Western  Electric  Company  (System  24) 
includes  83/^  of  their  personnel,  2000 
users.  Because  Western  Electric  uses 
group  profiles  however,  the  system  has 
only  300  Interest  profiles.  The  remaining 
two  systems.  Fort  Belvoir  (System  35)  and 
IBM  at  Poughkeepsie  and  Kingston  (System 
34),  operate  at  the  lOOK  level  and  use 
group  profiles. 

The  remaining  twenty-two  systems 
establish  that  SDI  Is  not  widely  accepted. 
We  removed  the  experimental  and  partially 
operational  systems  from  the  group,  and 
the  mean  percentage  of  eligible  users 
rose  from  I6!t  to  a  little  less  than  22K. 

As  stated  previously,  experimental  and 
partial ly  operational  systems  intention¬ 
ally  work  with  a  small  number  of  users. 

We  found  that  one  possible  reason 
for  low  participation  rate  Is  lack  of 
exposure.  When  the  SOI  system  at  Ames 
was  first  proposed,  little  Interest  was 
expressed  by  eligible  users  and  only  one- 
third  becMie  users.  After  the  scientists 
saw  results  from  SOI,  the  number  of  users 
rapidly  increased  to  75{  of  eligible 
users.  Those  Involved  with  the  SOI 
system  at  Fort  Monmouth  devaioped  a  pro¬ 
motional  campaign  to  generate  enthusiasm 
and  exposure.  Perhaps  Fort  Monmouth's 
approach  may  wall  be  advised  for  large 
organizations. 

We  also  found  soma  error  In  the 
figures  supplied  by  at  least  three  and 
perhaps  as  many  as  eight  systems  that 
belong  to  the  low  participation  group. 
These  systems  use  group  profiles  and  call 
"users”  those  people  to  whom  notices  are 
sent,  rather  than  all  of  the  people  in 
the  groups  who  receive  benefits  from  SDI. 
There  Is  some  reason  to  believe  that 
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i  liilitih'  usor  (igurc.  'jupplied  by  these 
‘.y, rtjflort  the  tiutnber  of  peop I e 
i  ilhet  111. in  Hie  iiumhot  of  groups  that 
.11  <•  f)|l(|il)|<'  for  r.Dl  .  Hence,  the  figures 
(.itinul  1)0  compared  br.-cduse  of  the  diffor- 
'•ni.*  in  units.  The  percentage  of  parti¬ 
cipation  would  definately  bo  raised  if 
these  figures  were  corrected,  but  we  can¬ 
not  delermiiH!  whettier  or  not  the  increase 
would  be  signif icar.i. 

Question  4.  Oroup  or  User  Profiles 

Sf)!  was  or i()i natty  designed  as  a 
■.oi  vice  to  individuals  so  that  each  pro- 

I  i  li'  was  to  retlect  the  information  needs 
I'l  an  individual,  tiowovor,  profile  con¬ 
st  i  act  ion  is  time  consuming  and  expensive 
enough  that  some  i nst<i I  lotions  use  pro- 
tiles  to  reflect  the  needs  of  a  group. 

M.iny  of  the  M)l  -.ystems  (about  15%) 
i;  .e  individual  profiles  either  exclu- 
ivi'ly  oi'  for  most  their  profiles, 
v.h.  II  Itieso  systems  u  <»  group  profiles, 
it  ;  .  by  usi'i  's  Choi  I  e, 

I’loup  profill's,  used  by  25%  of  the 

•  uivi'yi'd  ‘vys  terns,  piovide  selective  dis- 
•.eiiiiiiat  ion  niore  acorioinicat  ly.  For  in- 
,1. iiue  Westoin  I- lee  trie  (Tystom  ?A)  can 

Hie  needs  of  XOOO  people  with  only 
•>a  plot  lies,  fVofiling  time  and  com- 
,  iioi  linn*  arc  savrsl.  systems  using 

■  jioup  (iiot  ill's  bnso  profiles  around  task 

II  ''Ups. 

UltMip  profiles  are  easier  to  handle 
1 1  HI  file  sysliins  sfalt  viewpoint,  and 
i>  -le  i-.  '.01111’  I'vldonco  in  our  survey  that 
•l>.••l•.  m.iv  pii'ier  group  profiles.  One 

•  >Nji.'i>  iai  '.ysiriffl  iii.if  had  been  operating 
!  iy  with  individu'il  profiles  developed 

1  ;..i.>a.|e  similar  to  NAriA/SCAN  and  offered 
f.oi'i  b'  itu'ir  '  uslfspiers,  (See  IV.  Sciec- 
u  .!  l^ami'les  lor  diseusnlon  of  NASA/SCAN.) 
in.  '.y.fiiit  *o'. I  for  <iroup  profiling  was 
i.i  »..*d,  '.«*  the  Stivings  woro  passed  to 
if..'  ns.'r',.  ihoro  wa-.  .tpprnximalnly  70% 
in»jt— over-  from  inti ividu.il  to  group 
pi.iiib”..  lilt*  user',  tlid  not  mind  losing 
■i.liia'  iVai  i.itio  wi  III  individual  pro- 
i  i  o  e.  ■  iti-.f  if.oy  ‘.  ivt'd  monny  ind  pro¬ 


file  construction  time  and  then  infortned 
future  systems  planners  to  consider  group 
profiles  as  a  less  expensive  method  of 
meeting  user  needs. 

3.  Documents 

The  questions  found  in  this  section 
of  the  SOI  questionnaire  will  ba  discussed 
as  fol lows: 

(Juestions  1  and  10.  Systems  relying 
on  dissemination  centers  for  document  in¬ 
put.  Systems  contracting  for  Indexers. 

Question  2.  Average  document  input 

time, 

Questions  3  end  <1.  Number  of  docu¬ 
ments  included  as  SD I  input  per  month  and 
what  part  these  are  of  total  documents 
reviewed. 

Questions  5,  6  and  7,  Documents 
indexing  procedures:  vocabulary  and  limit 
and  average  number  of  Index  terms  per 
document. 

Questions  8  and  9.  Kind  of  documents 
used  for  SDI  document  Input. 

Question  II.  General  subject  areas 
of  Input. 


Questions  I  end  iO.  Systems  relying 
on  dissemination  centers  or  oontrseted 
Indexers  for  doeumsnt  Input. 

Selecting,  indexing  and  abstracting 
documents  are  the  biggest  on-going  tasks 
Involved  In  dissemination  of  current 
documents.  When  document  Input  can  be 
acquired  from  another  source,  the  system 
saves  time  and  personnel.  We  call  thasa 
sources  dissemination  centers  in  this 
report,  although  in  some  cases  dissemin¬ 
ation  Is  not  a  main  function  of  the  cen¬ 
ter. 

321  12  syetems  lOOf  dependent  on 
other  sources 
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V/-  1/  systcnr,  f’arti.iily  dopundetiT 

on  other  sources 

5f>%  14  systems  I’orform  all  docu¬ 

ment  input  in-houso 

The  12  systems  that  receive  100^  of 
their  document  Input  from  other  sources 
are  computer  systems.  They  use  one  of 
throe  approaches: 

Receiving  magnetic  tape 

of  document  records  (9  systems) 

Having  dissemination 

centers  perform  SDI  (2  systems) 

Using  only  author- 

i ndexsd  documents  ( I  system) 

The  12  systems  partially  dependent 
on  others  for  their  input  are  divided  as 
fol lows: 

Receive  some  input  from  dissemination 
centers  (6  systems) 

Use  contractors  for  all  Indexing  or 
whole  sot  process  (6  systems) 

A  little  less  than  two- thirds  of  the 
systems  surveyed  rely  on  other  sources 
lor  at  least  part  of  document  Input, 
Acquiring  ready  to  use  document  Input 
seems  to  be  an  e<tvlseble  approach  If  the 
documents  are  compatible  with  user  Infor¬ 
mation  requirements.  Receiving  document 
input  from  dissuwiinatlon  centers  often 
Himpitfies  the  task  of  designing  and 
implementing  a  functional  SDI  program 
hecouse  some  procedures  will  be  Indirectly 
control  led  by  the  center.  The  format  of 
document  input  may  have  some  effect  on 
user  profile  construction  and  matching 
techniques.  Software  packages  for  com¬ 
puterized  systems  and  consulting  are 
often  aval  table  from  dissemination  cen¬ 
ters. 

Question  2.  Average  document  input 

time. 

Input  time  refers  to  the  time  it 
takes  to  prepare  document  records  so  that 


they  are  ready  to  be  matched  against  user 
profiles.  As  we  indicated  in  the  proceed 
Ing  section,  some  systems  do  no  document 
input  processing.  The  following  summary 
of  responses  clearly  shows  the  time  saved 
by  using  dissemination  centers  or  con¬ 
tractors. 

1 1  systems  have  no  document  i nput 
time 

10  systems  take  between  2  and  90 
minutes  per  document* 

I  system  requires  6.5  hours  per 
document 

6  systems  made  no  response 

*NASA*s  two  systems  are  counted  as  I, 
since  both  systems  use  same  document  in¬ 
put. 

There  are  seven  non-computer i zed 
systems  In  this  survey.  Four  of  them 
have  measurable  document  input  time  and 
are  scattered  evenly  through  the  dis¬ 
tribution.  One  is  the  second  fastest, 
and  another  is  the  second  slowest.  Hence 
type  of  equipment  seems  to  have  little 
direct  relationship  to  speed  of  document 
Inputting, 

Wo  looked  at  two  factors  that  we 
enticipated  would  be  related  to  input 
time:  document  volume  and  depth  of  in¬ 
dexing.  We  found  that  no  relationship 
between  document  volume  end  Input  time 
exists  because  there  Is  no  measurable 
increment  between  time  and  volume  nor 
even  a  fairly  recurring  pattern  of  In¬ 
crease  in  time  versus  a  decrease  in 
voluRs.  We  had  only  slightly  bettor  luck 
with  depth  of  Indexing  versus  input  time. 
These  factors  are  related  In  5055  of  the 
systems,  but  no  Increment  could  be  found. 

We  concluded  that  document  Input 
techniques  are  highly  Indlvirfuui Ued. 

The  only  significant  group  is  the  group 
of  ten  systems  <50<)  that  do  no  docu¬ 
ment  input  processing. 

Omestlons  3  end  k.  Number  of  docu- 
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merit’s  processed  into  'jDI  each  month  and 
tho  por*1ion  ttioy  are  of  total  documents 
reviewed .  Percent  of  documents  inputted 
i n  to  SOI . 

Almost  all  systems  (28  out  of  a 
measurable  34)  process  1%  or  more  of  the 
documents  that  are  reviewed  for  SDI. 

Those  systems  receiving  total  document 
input  from  another  source  accept  100?, 
since  no  reviewing  or  purging  is  done 
by  any  oi  those  particular  systems  sur¬ 
veyed,  Orerar  Library  Inputs  only  10? 
of  reviewed  documents,  probably  because 
all  new  acq. .aitlons  come  to  the  Infor¬ 
mation  specialists’  attention.  The 
Hureau  of  Reclamation  accepts  only  5? 
of  .he  documents  reviewed  in  order  to 
input  only  the  most  significant  docu¬ 
ments  into  SOI.  The  indexers  at  fort 
lielvolr  select  about  25?  of  the  docu¬ 
ments  for  SDI,  and  the  rest  are  com¬ 
piled  for  a  bo<Jklot  of  recent  acquisitions 
that  includes  document  index  terms. 

Numoer  of  documents  processed. 

Tho  range  of  t.ie  number  of  documents 
pHKessed  Into  SDI  per  month  is  very  wide, 
from  25  documents  to  30,000.  The  systems 
were  ranked  in  order  and  divided  Into  two 
groups.  Group  I  contains  twenty  systems 
and  has  a  range  from  25  to  1200  documents 
per  month.  Group  II  has  fourteen  sys¬ 
tems  that  range  from  2200  to  30,000 
documents.  Four  systems  were  unable  to 
supply  data  for  this  question.  See 
Figure  2. 

We  tested  three  factors  tnat  we 
.issumed  would  affect  the  number  of  docu- 
mt»nts  a  system  processes:  the  number  of 
d<M;umonts  per  user  or  profile,  the  num- 
l>er  of  SOI  runs  per  month,  and  the  num- 
hnr  of  documents  per  user  or  profile 
p:?r  run.  These  factors  were  analyzed 
for  each  of  the  thirty-four  systems  to 
tUftermloe  whether  the  two  arbitrarily 
determined  groups  were  essentially  the 
-i.}met  that  Is,  perhaps  each  system 
prOcodses  approximately  the  same  nun^r 
of  documents  per  user  during  each  SOI 
run.  The  mean  averages  of  tested  factors 


appear  below: 


Group  i 

Group  1 

Doc/fflo 

473 

7592 

Profiles 

336 

554 

Doc/Prof  1 le 

II 

86.9 

Doc/Prof i 1 e/ 
Run 

7.1 

37.0 

After  reviewing  the  figures  above, 
we  excluded  the  two  largest  systems  from 
Group  1 1  because  they  were  much  larger 
than  the  other  systems,  but  only  Insig¬ 
nificant  changes  appeared  in  documents 
per  profile  and  documents  par  prof  He  for 
each  run. 

The  systems  I n  Group  1 1  not  on i y  have 
more  documents  and  more  profiles,  but  they 
proce.ss  more  documents  per  profile.  We 
looked  at  the  systems  that  form  these 
groups  to  explain  the  difference  between 
them.  Group  1  contains  rtost  of  the  ex¬ 
perimental  and  partially  operational  sys¬ 
tems.  These  systems  often  work  with  a 
sma 1 1  number  of  documents  I n  order  to 
establish  indexing  procedures  and  matching 
techniques.  Group  II  contains  commercial 
SDI  systems  (Crerar  and  Aerospace  Re¬ 
search  Applications  Ontar)  and  NASA. 

IBM's  corporate-wide  SDI  system  and  Ames 
Laboratory  art  also  in  this  group.  Even 
though  Ames  has  a  relatively  small  user 
population.  It  has  the  largest  document 
input,  partly  to  establish  the  feasibility 
of  a  centra  1 1  zed  dissemination  center  not 
limited  In  subject  coverage. 

We  conciuded  that  an  SDI  system  can 
be  established  to  serve  nearly  any  size 
group  with  nearly  any  volume  of  «k)eument 
input.  No  ratio  of  documents  to  users 
exists  for  the  surveyed  systems*  Non- 
cofflputartzed  systenm  were  evenly  divided 
between  Group  I  and  Group  It;  hence  type 
of  equipment  or  leek  of  equipment  has 
little  effect  on  size  of  the  system. 

Each  system,  whether  large  or  smell# 
seems  to  be  able  to  fulfill  its  funcflon# 
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FlGURK  2.  Number  of  DocumonUi  Proooaued  per  MooUi 


regardless  of  the  ratio  of  documents  to 
users. 

Questions  5,6  and  7, 

Indexing  procedures:  vocabulary  and 
maximum,  average,  and  minimum  number  of 
Index  terms  per  document. 

Participants  were  asked  whether  they 
use  free  language  or  a  control  led  vocab¬ 
ulary  to  index  documents. 

21  systems  use  controlled  vocabu¬ 
laries 

20  use  internally  developed 
vocabularies 

I  uses  external  vocabularies 
7  systems  use  free  language 

10  systems  subscribe  for  total 
document  input;  hence  the 
questions  are  not  applicable 

Use  of  Controlled  Vocabulary 

The  ten  systems  that  subscribe  to 
dissemination  centers  for  total  document 
input  have  no  control  over  document  In¬ 
putting  methods.  Hence,  these  systems 
are  excluded  from  analysis  of  indexing 
procedures.  However,  we  found  that  the 
centers  supplying  document  input  used 
controlled  vocabularies  In  seven  cases. 

The  other  three  are  participants  In 
Chemical  Abstracts  Service  SOI  experi¬ 
ment,  and  for  this  experiment  Chemical 
Abstracts  uses  free  language  abstracts 
as  document  Input. 

The  majority  of  SOI  systems  surveyed 
use  controlled  vocabularies.  Matching 
technique  and  profile  construction  are 
simplified  when  controlled  vocabularies 
are  used.  Twenty  of  the  systems  de¬ 
veloped  vocabularies  tailored  to  their 
neeas  either  by  modifying  or  supplementing 
pertinent  external  vocabularies  or  by 
constructing  completely  original  vocab¬ 
ularies.  The  only  system  relying  on 


external  vocabularies,  the  U.S,  Army 
Natick  Laboratories,  used  four  thesauri 
for  a  nine-month  pilot  test.  The  pilot 
test  has  been  completed,  and  procedures 
are  being  analyzed  in  order  to  Implement 
a  fully  operational  system.  During  this 
time  a  vocabulary  is  being  constructed 
using  the  four  thesauri  as  a  base. 

Hence,  a  1 1  surveyed  systems  that  use 
controlled  vocabularies  establish  internal 
vocabularies. 

Seven  systems  use  free  language. 

Four  of  these  use  computers  and  three 
are  manual.  Ames  Laboratory  SDI  system 
acquires  all  document  input  from  other 
sources.  Ames  uses  several  sources  but 
needs  only  one  program  to  Input  document 
information  regardless  of  vocabularies 
and  formats  used  by  Ames'  sources.  As 
a  result  document  input  Includes  ail  of 
the  following  tyras  of  format:  title 
and  author  only,  indexed  document 
citations,  abstracts  and  full  text.  One 
other  computerized  system  uses  free 
language  abstracts  as  document  Input 
(Bonneville  Power  Administration).  The 
corporate-wide  IBM  SC  system  inputs  free 
language  text.  One  other  computerized 
system  indexes  documents  using  free 
language. 

Three  systems  using  free  language 
are  manual  systems.  Documents  require 
no  Indexing  in  order  to  be  matched 
against  user  profiles.  Matching  docu¬ 
ments  and  users  requires  judgement  of 
Information  specialists  who  understand 
user  needs  but  are  not  limited  to  a 
fixed  vocabulary  or  to  a  formal  search 
strategy. 

Vocabulary  Size 

Most  vocabularies  are  between  1900 
and  9900  terms.  The  average  number  of 
index  terms  per  document  Is  usually  20 
or  less.  We  found  no  relationship  be¬ 
tween  the  size  of  vocabulary  and  the 
average  number  of  Index  terms  nor  be¬ 
tween  voliHne  of  doewnents  processed  and 
the  average  nuiMMr  of  index  terms. 
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Ill  *iumm.)ry  cckiItoI  led  vocabularies 
am  preferred  to  free  language.  Inter¬ 
nally  developed  vocabularies  are  or  will 
be  used  by  all  systems  using  controlled 
vocabularies.  Non-computer I  zed  systems 
are  divided  between  controlled  vocab¬ 
ulary  and  free  vocabulary  with  four  in 
the  former  and  three  In  the  latter. 

Most  controlled  vocabularies  contain 
between  1500  and  9500  terms.  Although 
most  of  the  systems  that  perform  their 
own  document  Inputting  functions  indicate 
1hat  there  is  no  upper  limit  to  the 
number  of  index  terms  per  document,  only 
three  systems  and  Chemical  Abstracts  use 
abstracts  or  text  as  Input.  Most  sys¬ 
tems  Index  up  to  twenty  words  per  docu¬ 
ment  . 

Questions  8  and  9.  Type  (internal 
or  externa  I /one-time  publications  or 
journal)  of  document  input. 

Participants  were  asked  to  Indicate 
what  portion  of  SDI  input  Is  one-time 
publications  and  what  portion  is  selected 
from  journals  or  periodicals.  These  two 
choices  combine  to  describe  total  docu¬ 
ment  Input.  Under  each  division  parti¬ 
cipants  indicated  how  many  one-time 
documents  and  Journals  are  Internally  or 
externally  generated  documents.  One  Is 
•  I  corporate-w I  da  system;  hence  documents 
written  by  anyone  In  IBM  or  found  in  IBM 
Journals  are  internal.  Another  system 
serves  only  the  Installation  of  Owego; 
hence  only  documents  written  In-house 
are  Internal.  The  third  system  is 
staffed  by  and  serves  two  IBM  offices 
at  PoughkMpsie  and  Kingston,  Hence 
documents  written  by  the  user  community, 
either  at  Poughkeepsie  or  Kingston,  are 
internal. 

The  ten  systems  receiving  ready-to- 
run  document  input  are  excluded  from 
supplying  Information  on  type  of  docu¬ 
ment  Input. 

4  systems  are  fairly  evenly  divided 

24  systems  rely  on  either  one-time 
publications  or 


journals  and  periodicals  for 
all  or  most  of  document  input. 

1 3  systems  use  one-time 
pub  1 1  cat ions 

4  use  external  documents 
7  use  external  documents 

i 

2  are  fairly  evenly  divided 

II  systems  use  journals  or 
periodicals 

II  use  external  journals  or 
periodicals 

Two  trends  become  apparent.  SOI 
systems  use  externally  generated  documents. 
Most  of  the  surveyed  systems  are  also 
seen  to  rely  on  a  single  document  type. 

The  Implication  here  is  that  systems  find 
one  or  just  a  few  sources  of  document  in¬ 
put  that  best  fit  users'  needs.  We 
think  It  Is  fair  to  assume  that  signifi¬ 
cant  Information  is  likely  to  be  pre¬ 
sented  in  all  four  forms,  although  In¬ 
ternally-generated  journals  are  probably 
rare.  However,  only  four  systems  pre¬ 
sent  a  mix  cf  one-time  publications  and 
journals.  By  narrowing  the  number  of 
sources,  systems  can  process  document 
Input  with  less  work. 

(Question  II.  General  subject  areas 
of  document  Input. 

A  chart  has  been  constructed  to  In¬ 
dicate  the  subject  areas  of  SDI  for  each 
system.  (See  Table  I.) 

4.  Prof  I  la 

The  SDI  subscriber's  interest  profile, 
I.e.,  the  aggregate  of  unit  terms  de¬ 
scribing  the  user's  areas  of  Interest, 
is  one  of  the  very  few  features  borne  in 
common  by  all  SOI  systems  being,  as  It 
were,  one  of  the  defining  criteria  of 
this  type  of  Information  specific  activity. 

This  section  Is  devoted  to  an  exam- 
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ln.it  I  on  of  various  profile  techniques  of 
generation  and  control.  The  manner  in 
which  the  system  internally  operates  on 
those  profiles  is  examined  in  a  later 
section  (matching). 

Vocabulary 

In  the  selection  of  the  unit  terms 
to  be  contained  in  his  Interest  profile, 
the  user  is  allowed  free  choice  in  less 
than  one  third  of  the  systems  surveyed. 

The  great  majority  of  systems  impose 
some  manner  of  controlled  or  prewritten 
vocabulary  from  which  the  user  must  make 
his  choice.  It  Is  of  interest  to  note 
that  in  the  free  choice  situations,  over 
half  of  the  systems  actua 1 1 y  operate  on 
the  user’s  selected  terms.  In  the  re¬ 
maining  systems,  the  user-selected  terms 
are  translated  into  some  form  of  con¬ 
trolled  vocabulary  that  the  system  uses 
I  nterna  My. 

Four  computerized  systems  use  free 
language  document  Input  and  profiles. 

The  ease  and  flexibility  of  document 
inputting  In  free  language  computerized 
system  Is  somewhat  overshadowed  by  the 
complexities  of  profile  construction. 

The  user  may  Include  any  term  he  wishes 
and  long  profiles  or  mgitl-prof I les  are 
encouraged.  However,  the  user  also  has 
an  obligation  to  Include  alternate 
spellings  and  forms,  e.g.,  ionization, 
ionisation,  Charles  R.  Sage,  C.R.  Sage, 
symbolic  and  alpha  chemical  formulas. 
Synonyms  and  related  words  need  to  be 
Included  also  and  are  generally  the  user's 
rosponsibt llty.  However,  SOI  personnel 
are  particularly  aware  of  the  profiling 
difficulties  and  are  prepared  to  give 
additional  aid. 

The  significant  majority  of  systems 
operates,  at  least  Internally,  on  some 
form  of  controlled  vocabulary,  or 
thesaurus.  These  systems  In  turn  are 
divided  with  two-thirds  using  Internally 
generated  special  purpose  thesauri  and 
one-third  using  a  published  thesaurus 
such  at  O.D.C.  (the  Defense  Documentatic'n 
Center  vocabulary),  MESH  (the  medical 


vocabulary  of  the  MEDLARS  system),  or  E.J. 
C.  (Engineers  Joint  Council  thesaurus  of 
engineering  terms). 

Construction  of  FrofUes 

Only  one  of  the  iyttems  surveyed 
practiced  "autoprof  1 1 Ing;"  l.e.,  con¬ 
structing  the  user's  profile  automatically 
from  the  user's  job  description,  project 
description,  library  request  records, 
etc.  This  concept,  a  popular  theory  In 
the  past,  has  apparently  not  gained  wide¬ 
spread  acceptance  In  practice. 

Construct  I  ng  profiles  •'egard  I  ess  of 
complexity  Is  a  difficult,  time  consuming 
task.  One  of  the  participants  Indicated 
that  most  of  the  time  users  do  not  know 
what  to  ask  for  to  produce  desired  re¬ 
sults.  Some  systems  have  employed  addi¬ 
tional  means  for  making  profiles  oper¬ 
ational  more  quickly.  One  experimental 
system  performs  matching  according  to 
each  user's  profiles  but  sends  alT  ab¬ 
stracts  to  the  users  for  comments  on  a  1 1 
for  the  first  few  SOI  runs.  Or^i  system 
asks  users  to  bring  prototype  documents 
that  reflect  user  Interests.  In  addition 
to  profiling  aids  most  systems  expect 
staff  members  to  spend  tine  aiding  users 
in  profile  construction. 

Three  surveyed  systems  have  removed 
the  profiling  task  from  the  users  al¬ 
together  by  presenting  the  user  with 
defined  subject  categories  each  broad 
enough  to  precipitate  notices.  After 
the  user  makes  his  selection,  systems 
personnel  construct  profiles.  One 
commerical  SOI  system  offers  users  In¬ 
dividually  tailored  profiles  and  this 
new  approach  to  profiling.  In  some  cases 
the  user's  cost  Is  cut  In  half  by  picking 
subjects,  and  there  has  been  e  1(3$ 
change-over  from  Individually  tailored 
profiles.  The  system  perhaps  more 
graphically  than  any  other  example 
Illustrates  that  users  believe  their 
Information  needs  can  still  be  met  with 
this  approach.  One  IBM  system,  celled 
SCAN  (System  34),  Is  the  forerunner  of 
these  systeam,  having  been  In  operation 
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for  three  years. 

U>rm»  par  F'rofUe 

As  to  the  profiles  themselves,  a 
1 1  file  more  than  half  of  the  systems  place 
no  restrictions  on  the  number  of  terms  a 
user  may  have  In  his  profile.  In  these 
sysf'’7«.  with  limited  term  profiles,  the 
upper  limit  varies  from  7  to  9999  terms. 

A  qualification  of  those  systems  with 
very  high  upper  limit  figures  (1000  or 
more  terms  ^sr  profile)  appears  when  they 
are  referred  back  to  the  type  of  vocab¬ 
ulary  being  used.  Systems  permitting 
open  (or  free)  choice  vocabularies  either 
have  very  high  upper  term  limits  or  no 
limits  at  all.  Controlled  vocabulary 
systems,  on  the  other  hand,  tend  to  pro¬ 
duce  very  definite  limits  with  a  range 
from  7  to  1000  terms  with  most  prof  I les 
(for  this  type  of  system)  containing  200 
forms  or  less.  For  all  systems  where  It 
is  possible  to  compare  the  average  number 
of  terms  per  profile  against  the  maximum, 
file  average  number  of  terms  is  less  than 
half.  For  systems  with  no  upper  limit, 
the  average  Is  30  terms  per  profile  or 
less.  Only  two  systems  average  over  100 
terms  per  profile*  and  In  both  cases  the 
average  Is  less  than  half  of  the  maximum. 
Since  profiles  average  considerably  fewer 
forms  than  the  maxiimw,  the  systems  are 
designed  with  larger  capacities  than  are 
currently  being  used. 

It  is  difficult  here*  as  In  the 
previous  sectional  analyses,  to  draw  very 
many  revealing  conclusions.  A  tendency 
toward  highly  Individualistic  designs 
becomes  Increasingly  apparent  with  the 
exception  of  “^he  understandable  system 
preference  for  Mtabilshed  or  "fixed" 
vocabularies. 

Use  of  Heights  and  Logic 

An  examination  of  the  systems' 
internal  medumtcs  as  regards  profile 
manipulation,  on  the  other  hand,  pro¬ 
duced  several  marked  uniformities  among 
the  study  population.  Over  two  thirds 
of  the  systems  utl  I  Ited  soma  form  of 


weighting  or  syntactical  logic  between 
terms  within  a  user  profile.  These  tech¬ 
niques  varied  from  a  simple  set  of  logical 
operators  (and,  or,  not)  to  scales  of 
interger  weights  with  ranges  as  high  as 
1000.  Those  systems  not  utilizing 
weighting  or  logic  treat  each  term  in 
the  profile  as  having  equal  Importance 
to  the  user  and  determine  relevancy  or 
degree  of  interest  by  the  number  of  terms 
matched  against  a  document.  See  the 
discussion  of  matching  in  Section  6  for 
a  detailed  discussion  of  matching  tech¬ 
niques. 

5.  Profile  Modification 

Most  or  all  of  the  SOI  user  popula¬ 
tion  will,  over  a  period  of  time,  experi¬ 
ence  changes  In  Its  areas  of  technical 
interest.  Thus,  to  maintain  a  relevant 
service,  the  SOI  system  must  be  flexible 
in  its  ability  to  alter  a  user's  interest 
profile.  This  section  of  the  study  Is 
directed  at  a  review  of  the  methods  the 
various  systems  employ  to  adapt  to  chang¬ 
ing  user  interests. 

All  of  the  surveyed  systems  permit 
the  alteration  of  user  profiles.  This 
appears  to  be  a  uniform  feature  of  all 
SDI  services,  and  variations  are.  encount¬ 
ered  only  in  the  means  and  frequency  of 
'modification. 

Over  65%  of  the  systems  studied 
altered  profiles  or  permitted  profile 
alteration  as  the  result  of  each  dis¬ 
tribution  of  user  notices.  Of  these 
systems,  those  in  the  experimental  or 
developmental  phase  relied  heavily  on 
personal  Interaction  (Interviews)  be¬ 
tween  the  user  and  system's  staff.  Those 
systems  In  full  operation,  on  the  other 
hand,  relied  almost  entirely  on  some  form 
of  routine  notice  by  the  user,  usually 
a  returnable  portion  of  the  user  notice. 

Approximately  20$  of  the  systems 
only  allowed  alteration  of  profiles  at 
fixed  calendar  intervals.  These  periods 
ranged  from  bimonthly  to  semi-annually. 
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The  rentiiininn  systems  (approximately 
yielded  a  variety  of  responses  to  the 
question  of  frequency,  although  the  im¬ 
plication  Is  clear  that  modifications  are 
accepted  at  the  request  of  the  user. 

These  systems  may  be  regarded  as  minor 
variations  of  the  set  which  permits  modi¬ 
fications  per  notice. 

Interestingly,  only  one  system  (Ames 
Laboratory,  USAEC)  Indicates  that  the 
profile  is  modified  automatically  by  tne 
systems  computer  program.  This  technique, 
os  with  autoprofiling,  is  another  instance 
of  a  popular  theoretical  concept  which  has 
experienced  only  limited  practical  accep- 
l.:nce. 

6.  Match  I ng 

Matching  is  the  focal  point  of  SOI. 
Profile  construction  and  document  Input 
are  done  In  preparation  for  matching 
users'  interests  against  new  documents. 

As  a  result  of  matching,  users  receive 
notices  of  relevant  documents.  Two 
questions  are  discussed  here: 

Question  I.  Method  of  matching 
against  documents 

Question  2.  Condition  necessary 
to  send  a  notice  to 
a  user 

Question  I.  Method  of  matching 
proM les  against  documents. 

The  four  combinations  of  matching 
user  profiles  against  document  profiles 
wero  offered:  Individual  or  aggregated 
user  profiles  against  Individual  or 
mKiregated  document  profiles.  "Aggregated" 
fflo.tns  that  all  terms  are  reordered  into 
a  single  master  list  so  that  no  term 
appears  more  than  once.  Using  aggregated 
profiles  takes  less  space  in  computers 
and  in  general  simplifies  linear  matching. 
However,  creating  a  master  list  necessi* 
tatos  sorting  operetlons  to  get  terms 
associated  with  the  proper  user  or  docu¬ 
ment  and  to  remove  duplicate  notices  for 
tho  same  document. 


The  participants  use  the  following 
approaches: 

24  Individual  user  profiles  against 
Individual  document  profiles. 

I  Aggregated  user  profile  against 
Individual  document  profiles. 

3  Individual  user  profiles  against 
aggregated  document  profile. 

4  Aggregated  user  profile  against 

aggregated  document  profile. 

6  Not  applicable. 

The  question  Is  not  applicable  to 
four  manual  systems  that  use  SDI  personnel 
to  determine  hits.  (The  other  three  man¬ 
ual  systems  have  formal  matching  strategy.) 
The  other  two  systems  for  which  this  ques¬ 
tion  Is  not  appllcabla  do  not  perform  SOI 
functions  in-house. 

One  of  three  matching  strategies  are 
used  by  most  systems:  linear  matching  (14), 
boolean  logic  (6)  and  weighted  terms  (9). 
Linear  matching  can  be  performed  effi* 
clently  using  any  of  the  four  approaches 
listed  above,  but  most  linear  systems 
prefer  the  first  choice.  The  first  choice 
seems  almost  a  must  for  systems  using 
boolean  logic  and  weights,  and  most  of 
these  systems  do  use  Individual  user  and 
document  profiles.  However,  two  systems 
using  weights  aggregated  document  records, 
and  one  of  these  systems  aggregated  both. 

Question  2.  Conditions  Ne<»ssary 
to  Send  Notices  to  Users. 

In  the  matching  process  It  ts  ne¬ 
cessary  to  determine  when  a  match  between 
user  and  document  ts  Important  enot^h  to 
send  the  user  a  notice.  This  level  of 
significant  match  Is  celled  the  threshold 
or  hit  level.  For  instance.  If  the  re¬ 
quirement  Is  that  at  least  three  terms 
In  a  user's  profile  metch  terms  in  e 
document  profile,  the  hit  level  Is  3. 

in  order  to  look  for  meaningful  re- 
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lationships,  the  systems  are  divided 
according  to  the  type  of  user  profile 
used:  linear  terms,  boolean  logic, 
weighted  profile  terms,  and  other.  A 
rally  of  answers  Is  presented  in  the 
chart  below: 


Inc  same  six  systems  excluded  from  the 
discussion  of  the  preceding  question 
are  excluded  here  also.  In  order  to 
see  some  of  the  different  approaches 
given  In  the  chart,  representative 
systems  will  be  explained. 

Fort  Monmouth  uses  linear  matching. 
A 1 1  user  prof  I le  terms  are  i ndependent 
and  equal  in  value.  Fort  Monmouth  allows 
users  to  use  negative  words  that  over¬ 
ride  matches.  One  match  is  sufficient 
for  a  user  to  receive  a  notice  of  the 
document. 

Boolean  logic  uses  the  boolean 
t^porators  and,  or,  and  not  to  connect 
user  prof  l  ie'  terms  into“T^lc  state- 
monts.  The  hit  level  Is  fixed.  Al¬ 
though  it  varies  from  system  to  system, 
it  does  not  vary  within  a  system, 
i  ither  the  document  terms  satisfy  the 
•••qlc  statement,  or  they  do  not.  One 
Interesting  approach  to  boolean  logic 
w.i*i  developed  at  Fort  Detrick.  A 
b<x)iean  threshold  of  10  is  fixed;  user 
profile  terms  are  '^weighted'*  in  such  a 
way  that  desirable  combinations  of  terms 
sum  to  10  exactly  and  a  notice  Is 
•lonerated.  if  the  weights  of  the  matched 
terms  exceed  or  are  less  than,  no 
ixitch  is  made. 

The  SOI  system  at  Ames  Laboratory 
is  an  unusual  system,  since  It  is  the 
only  one  that  varies  the  hit  level 
according  to  documents,  in  this  system 


user  profile  terms  are  weighted  between 
0.0000  and  0.9999  to  four  significant 
digits.  Most  systems  assign  hit  scores 
to  user  profiles,  but  Ames  assigns  a  hit 
level  to  documents  according  to  the  depth 
of  indexing.  Texts  have  a  hit  level  of 
0.7500;  abstracts  are  0.5000;  and  titles 
are  0.3000.  This  system  needs  this  sort 
of  approach  for  efficient  service  because 
of  the  varied  formats  for  document  input. 

Two  systems  in  the  ''Other”  group 
use  a  hlerarchal  indexing  system  from 
general  to  specific  subject  areas  for 
both  documents  and  user  orofiles.  A 
matet  occurs  when  a  document  code  matches 
or  is  more  specific  within  the  same  area 
as  the  user  term  code. 

The  other  system  In  this  last 

group,  Bonneville  Power  Admin 1st rat ’on 
uses  weights  and  boolean  operators  to¬ 
gether.  This  system  Is  one  of  the  three 
surveyed  systems  using  abstracts  or  text 
as  Input.  Using  abstracts  requires 
complex  profile  construction  and  matching 
technique.  Bonneville  uses  a  weighting 
scale  for  profile  terms  from  -9  to  -9 
and  8  hit  level  from  00  to  99,  but  In 
addition  other  features  are  Included. 
"Synonym  families"  simulate  "or”  In 
boolean  logic.  The  user  uses  a  code 
for  those  situations  where  one  word  Is 
as  good  as  another;  but  if  both  were 
to  be  found  In  the  same  document,  they 
would  count  only  once.  "Not"  In  this 
system  Is  not  weighted  and  serves  as 
total  negation  of  a  document  regardless 
of  other  matches.  "Must"  Is  also  not 
weighted  and  means  that  e  document  wl 1 1 
be  suppressed  unless  the  must  term  is  in 
the  abstract.  Weighted,  not  and  must 
terms  are  used  both  for  eMct  and  root 
terms.  Root  terms  match  with  all  docu¬ 
ment  words  beqionlnq  with  the  same 
letters  in  the  same  spacing  and  punctu¬ 
ation. 

The  most  significant  factor  In 
hit  level  varletion  Is  that  It  the  hit 
level  varies,  it  varies  user  by  user 
r-ther  than  document  by  document.  When 
li  >er  matching  is  used,  tne  systems 
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surveyed  are  fairly  evenly  divided  between 
the  several  options,  fioolean  strategy 
implies  a  fixed  hit  levei.  When  weighted 
profiie  terms  are  used  in  matching,  the 
hit  ievei  usual iy  varies  and  it  varies 
user  by  user.  One  system  using  weights 
adjusts  users'  profiies  so  that  aii  have 
a  hit  level  of  0.999.  Each  user  con¬ 
structs  his  profile  using  a  weighting 
scale  from  I.  to  9,  and  he  assigns  the 
proper  hit  levei.  Then  the  system  sets 
the  hit  level  at  0.999  and  adjusts  the 
weights  within  this  structure. 

7.  Notice 

According  to  the  SOI  concept,  SOI 
is  an  alerting  service.  The  point  of 
contact  between  the  system  and  the  user 
is  the  notice  of  new  documents  of  prob¬ 
able  Interest.  Notices  mark  the  end  of 
the  SOI  process. 

Questions  t  and  2.  Content  and 
form  of  user  notice. 

Question  3.  Frequency  of  dis¬ 
tribution. 

Questions  I  and  2.  Content  and 
Form  of  user  Notices. 

Within  system  limits  the  notice  is 
established  to  be  compatible  with  users, 
needs.  Since  SOI  systems  have  two  factors 
In  common  -  notice  as  only  the  point  of 
interface  between  user  and  system,  and 
notice  design  as  the  measure  of  user  in¬ 
formation  requirements  -  an  analysis  of 
user  notices  should  yield  a  typical  user 
notice.  Experimental  and  partially 
oporatlonal  systems  are  excluded  be¬ 
cause  the  bser  notice  is  one  of  the  para¬ 
meters  usually  under  development  and 
hence  does  not  truly  reflect  coordination 
of  user  needs  and  system  opacity. 

The  typical  notice  contains  an 
abstract  and  accession  number.  Systems 
including  classified  documents  in  the 
Input  print  citations  for  classified 
doruments  and  abstracts  for  unclassified 
material.  If  the  SOi  system  uses  eetghta, 
a  weight  factor  may  also  be  Included.  A 


return  request  form  is  included,  implying 
that  each  Installation  possesses  the 
document  is  cites.  Some  method  for  pro- 
f i  le  modification  May  also  be  Includ^. 

Although  the  content  of  a  notice 
can  be  generalized,  the  form  used  by  the 
surveyed  systems  is  varied.  The  moat 
coftOTon  characteristic  is  Itidt  the  notices 
are  machine  printed.  Since  most  of  the 
systems  In  the  survey  are  computerized, 
printed  notices  are  expected.  However, 
half  use  punch  cards  or  double  hinged 
punch  cards.  Some  print  on  computer 
paper,  and  some  use  cards  of  other  than 
punch  card  size.  One  system  uses  machine 
pr'nt-out  on  cards  for  all  information 
except  the  abstract,  which  is  xeroxed 
onto  the  cards. 

Only  thrae  fully  operational  systems 
de 1 1 ver  documents  as  a  notice.  An  earlier 
IBM  experimental  SOI  system  distributed 
copies  of  the  document;  but  as  the  number 
of  users  increased,  distributing  docu¬ 
ments  became  economically  tnadvisabis. 

In  general  user  comment  did  not  endorse 
document  distribution.  A  user  seems  to 
prefer  an  abbreviated  version  of  the  docu¬ 
ment,  particularly  when  a  system  has  a 
large  data  base.  Sometimes  Just  an  aware¬ 
ness  of  the  document  is  sufficient  so  that 
an  abstract  is  satisfactory.  Hoxever,  the 
function  of  each  system  should  be  consid¬ 
ered  before  final  Judgment  on  document 
delivery  as  the  notice  Is  made.  The 
three  systems  that  send  documents  initially 
are  designed  In  such  a  may  that  not  to 
deliver  documents  Is  Impractical. 

Question  3.  Frequency  of  Notice 
Distribution. 

The  twenty-five  operational  systems 
distribute  notices  In  the  following  man¬ 
ner: 

9  Daily 
9  Weekly 
6  Twice  Monthly 
9  Monthly 

Frequency  of  notice  does  not  Siam 
to  be  related  to  documents  per  run. 
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Hudqot  figure*;  aro  .jvai  table  for  only  ten 
•-V'.ffWi  in  this  group,  tor  these  systems 
Kjoro  is  no  relationship  between  money 
spent  on  Sf)l  and  the  number  of  runs 
made.  Frequency  of  notice  distribution 
depends  on  user's  need  for  current  lnfor~ 
mat  I on  and  systems  design  rather  than  on 
budget  or  document  volume. 

8.  Equipment  and  Personnel 
iguipment 

Utilization  of  fixed  resources,  as 
with  the  budgetary  data,  was  difficult 
fo  define  in  the  survey  responses.  The 
principal  reason  is  that  all  but  a  few 
of  the  SOI  systems  studied  were  periph¬ 
eral  functions  of  larger  systems,  sharing 
with  the  parent  system  Its  equipment, 
facilities  and  personnel. 

Proportions  of  equipment  and 
personnel  devoted  to  SOI  functions  not 
only  varied  widely  from  system  to  system, 
but  within  systems  from  time  to  time  as 
requirements  and  work  loads  varied. 

Ail  of  these  factors  combine  to 
defy  any  revealing  comparative  analysis 
and  to  obscure  correlations  between 
various  other  system  factors  such  as 
volume,  population  and  cost. 

Of  a  nuiM>er  of  quantifiable  factors, 
w4  found  that  approximately  73S  of  the 
synteme  surveyed  utilized  a  computer 
capability  In  one  of  numerous  forms  and 
levels  of  utilization. 

In  those  systems  reporting  personnel 
«i'>slgnments,  the  large  majority  (again 
ovor  used  six  or  less  persons  as 
permanent  staff.  Again  it  was  not 
possible  to  make  any  correlation  between 
manodiours  and  budget*  In  fact,  attempts 
in  this  area  lead  us  to  believe  that  at 
ie^st  one  of  these  two  factors  has  been 
reported  erroneously  In  numerous  In¬ 
stances. 

Physical  configurations  varied 
throughout  a  wide  rwige  of  sophistication 


from  manual  card  files  to  an  IBM  system 
360/50  with  a  fully  loaded  channel. 
Methods  of  source  data  automation  cen¬ 
tered  strongly  around  the  use  of  key¬ 
punching  with  two  Instances  of  the  newer 
technique  of  keytape  entry r 

One  of  the  interesting  aspects  of 
equipment  utilization  is  the  sharp  de- 
markation  found  when  progressing  from 
manual  to  automatic  systems.  There  is 
virtually  no  middle  area  of  semi-manual 
or  semi-automatic  process  (i.e.,  punched 
cards,  optical  coincidence,  etc.)  while 
the  remaining  were  either  entirely  manual 
or  ful ly  automatic. 

Another  Interesting  aspect  was  the 
relatively  high  occurence  of  multiple 
computer  utilization.  Fully  53f  of  the 
represented  systems  used  two  or  wore 
computers  in  various  processing  steps 
of  the  SDI  cycle.  Two  of  the  systems 
indicated  the  use  of  three  computers, 
and  one  system  listed  four.  In  the 
great  majority  of  dual  computer  use,  the 
second  machine  was  of  a  relatively  re¬ 
stricted  capability  (such  as  IBM  1401, 
Univac  1004)  and  used  primarily  as  an 
off-line  printer.  Diffusion  of  pro¬ 
cessing  steps  among  two  or  more  computers 
in  the  same  system  of  roughly  the  same 
capability  level  was  primarily  to  permit 
utilization  of  existent  software,  de¬ 
veloped  for  or  shared  by  other  non-SOI 
applications. 

The  above  summary,  and  particularly 
the  last  observation,  indicates  strongly 
that  SOI  is  a  relatively  low  priority 
operation  in  a  multipurpose  computer 
installation.  In  fact,  at  this  point 
in  the  survey,  the  SOI  concept  is 
emerging  as  a  precariously  established 
system,  developed  on  an  ad  hoc  basis 
and  existing  at  the  sufferance  of  higher 
priority  budgetary  and  resotirce  require¬ 
ments. 

Personnel 

Personnel  by  category  or  profes¬ 
sional  level  were  reported  in  very  nearly 
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all  systems  studied  and  permitted  still 
another  interesting  observation.  In  its 
earlier  form,  5DI  was  viewed  as  largely 
a  semi -professional  or  clerical  occu¬ 
pation.  However,  a  profiling  of  personnel 
engaged  In  this  activity  shows  that  for 
each  clerk  there  is  one  technician 
(indexer,  abstractor,  etc.)  and  1.25  full 
professional  analysts.  On  the  other  hand, 
1here  are  only  0.5  persons  engaged  in 
machine  operation  and  0,2  persons  listed 
as  programmers. 

Analyzing  those  profiles  by  system 
type  (e.g.  computerized  or  manual)  shows 
a  slight  shift  in  the  clerical  based 
ratio,  as  indicated  in  the  below  table, 
which  appears  to  indicate  that  the  In¬ 
troduction  of  a  computer  not  only  intro¬ 
duces  new  personnel  typos  (programmers) 
but  seems  to  increase  the  overall  staff 
as  well. 


A 1 1  Computer  Manua 1 

I'rwfesslonal  Analyst 

1.25 

1.4 

0.75 

Technician 

1 

l.l 

0.4 

lYogrammer 

0.2 

0.2 

0 

Machine  operator 

0.5 

0.6 

0.25 

Clock 

1 

1 

1 

The  temptation  to  bel 

lave 

that  this 

phenomenon  explodes  the  myth  of  the 
computer  as  a  labor  sav.ng  device  must 
give  way  to  piausable  explanation.  The 
increase  In  the  ratio  of  professional 
analysts,  for  example,  simply  reflects 
m>iridgement's  reluctance  to  leave  a 
si>phlsticated  and  higher  cost  system  in 
the  hands  of  technical  or  semi-profes¬ 
sional  personnel.  The  rise  In  machine 
operators  reflects  the  addition  of  such 
fMcupationt  as  keypunch/keyvert tiers,  and 
fhe  rise  In  technical  staff  follows  the 
computer's  ability  to  process  large 
vot'.mies.  A  more  meaningful  analysis  of 
course  would  be  the  ratio  of  staff  to 
users  but  again  no  correlations  are 
obtainable  from  the  surveyed  data. 


B.  THREE  SDIxSYSTEMS  IN  THE  UNITED  KINGDOM 

■■■'  '  %  '  I  II  Ill  ■!  1^—— — 

One  feature  of  this  state-of-the-art 
survey  Is  to  Include  foreign  SOI  systems. 
Unfortunately  documentation  on  SDI  systems 
outside  the  United  States  Is  not  as  nu¬ 
merous  as  that  for  U.S.  systems.  The  time 
necessary  to  do  a  thorough  search  was  un¬ 
available;  hence  the  three  systems  Included 
In  this  survey  should  not  be  considered 
as  a  representative  samp. te.  We  are  aware 
of  one  more  SDI  systeri  serving  a  steel 
Industry  based  In  Glascow,  Scotland,  and 
cited  In  ASLIB  Proceedings,  January  1965. 

Th 1 s  manual'  sysfem  Is  large .  Unfortunate  I y , 
we  were  unable  to  send  a  letter  to  this 
company  to  Invite  participation  In  the 
survey. 

The  three  systems  that  were  contacted 
responded  immediately  with  a  completed 
questionnaire  and  very  helpful  Information. 
Their  thoughtfulness  and  promptness  Is 
greately  appreciated.  These  three  systems 
represent  a  cross-section  of  types:  one 
commercial  system,  one  government  system, 
and  one  research  and  development  system. 

The  Scientific  Documentation  Centre 
based  in  Dunfermline,  Fife,  United  Kingdom 
Is  the  commercial  SOI  system.  The  system 
has  8  perticularty  large  document  volume, 
300,000  items  yearly,  from  e  wide  range 
of  sources  and  countries  covering  e  wide 
range  of  topics.  Documents  Include  ail 
of  U.S.  Government  Research  and  Develop¬ 
ment  Reports  and  appropriate  parts  of 
U.S.  and  British  Thesis  Titles.  The 
system  started  four  years  ego  solety  for 
SOI.  Gradual iy  the  topics  are  be'ng  made 
avB liable  for  retrospective  searches  also 
so  that  naif  are  included  in  a  storage  bank 
at  present.  Science,  technology,  medicine, 
botany,  zoology,  and  engineering  art  some 
of  the  fields  included.  Fees  very  accord¬ 
ing  to  topics  selected.  Most  topics  cost 
between  fifteen  and  fifty  pounds  per  year. 
Many  of  these  prices  are  less  than  Ameri¬ 
can  systems. 

A  potentiel  user  reviews  the  iist  of 
topics  available  and  indicates  his  chotoes. 
The  staff  at  the  Centre  construct  the  pro¬ 
file.  The  user  receives  weekly  notices  on 
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5  X  card  or  a  punch  card  according  to 
his  preference.  Each  card  contains  a 
citation  and  keywords.  The  user  may 
order  the  document,  but  ordering  docu¬ 
ments  requires  an  additional  fee. 

Specific  questions  on  users  and 
profile  construction  are  considered  confi¬ 
dential,  as  are  the  workings  of  the  sys¬ 
tem.  A  staff  of  forty,  mostly  students 
who  work  part-time,  maintains  this  manual 
SOI  service,  once  again  establishing 
that  manual  systems  are  no+  limited  in 
size  or  capabi I Ity. 

The  Culham  Laboratory,  United 
Kingdom  Atomic  Energy  Authority,  London, 
operates  an  SOI  system  both  for  internal 
and  external  users.  The  lab  has  always 
supplied  a  manual  SDI  service  and  until 
three  years  ago,  it  was  satisfactory. 
However,  the  number  of  users  increased 
until  It  became  impossible  for  the  infor¬ 
mation  staff  to  meet  the  needs  of  all 
users  adequately.  Experiments  were  con¬ 
ducted  that  resulted  in  a  fully  operation¬ 
al  system  being  implemented  a  year  and 
a  half  ago.  This  system  operates  with 
no  additional  staff,  a  minimal  amount  of 
additional  c lari  cal  support,  and  no 
Interruption  or  diminishing  of  regular 
library  functions. 

This  system  uses  document  titles 
as  Input.  When  titles  are  not  suffi¬ 
ciently  descriptive,  additional  words 
are  used.  The  average  number  of  terms 
per  document  is  twelve.  Profiles  are 
constructed  In  the  form  of  a  matrix 
with  ten  columns  and  thirty  rows,  allowing 
a  maximum  of  300  words.  The  ever.:^ 
prof  1 1 e,  however,  contains  only  20  terms. 
The  terms  are  connected  by  and,  or  and 
not.  This  SOI  system  operaTii  wTThout 
usTng  a  controlled  vocabulary. 

''he  computer  used  Is  an  English 
Electric>^leo-M8rcont  KDF9  with  a  random 
accMS  disc  file  of  about  four  million 
words.  Core  storage  holds  32,000  words 
and  magnetic  tape  flies  contain  atKMt  2 
million  words  pM*  reel.  The  Input  device 
Is  an  ULTfUNIC  811,  a  punchad  papar  tapa 
maenina. 


Personnel  involved  In  this  system’s 
operation  are  particularly  pleased  with 
the  number  of  relevant  notices  each  user 
.-eceives,  particularly  because  only  titles 
and  authors  are  used  as  input.  One  factor 
involved  In  this  high  precision  ratio  Is 
that  prior  to  sending  notices  to  users 
tne  Information  staff  reviews  each  user's 
notices  to  eliminate  Irrelevant  citations. 

There  are  200  users,  10  managers  and 
190  research  scientists,  who  receive 
notices  twice  weekly  If  users  are  In- 
house  and  once  weekly  if  the  users  are 
external.  Document  flow  is  150  per 
week  or  about  650  per  month.  Input  time 
per  document  is  5  minutes  and  Includes 
preparing  the  paper  tape.  The  subjects 
included  In  SOI  are  plasma  physics, 
controlled  thermonuclear  fusion,  ultra 
high  vacuum,  cryogenics,  super-conductici ty, 
lasers,  microwaves. 

The  Laboratory  Is  successful  In 
effecting  smooth  changeover  from  manual 
to  computerized  SOI  service  without 
overburdening  the  staff  or  Jeopardizing 
other  library  functions. 

The  third  SDI  system  was  developed 
as  a  project  to  lest  the  value,  cost,  and 
acceptance  of  a  selective  dissemination 
of  Information  system.  The  project  is 
expected  to  last  four  years.  For  the 
first  two  years  the  National  Electronics 
Research  C^ncil  was  in  charge.  The 
primary  task  was  to  establish  a  vocabulary 
and  indexing  procedures.  The  vocabulary, 
patterned  after  the  Engineers  Joint  Council 
thesaurus  of  engineering  terms,  contains 
1800  words  and  grows  at  the  rate  of  approx¬ 
imately  so  terms  per  week.  That  rata  of 
growth  will  sharply  diminish  after  the 
system  has  been  (^eretlonal  for  awhile. 

The  project  hat  been  transferred  to  the 
Institute  of  Electrical  Englnaers  who 
received  a  grant  of  61,350  pounds  for 
27  months  to  continue  the  project. 
lEE's  task  is  to  sal act  600  alectronics 
research  workers  from  universities.  In¬ 
dustries,  and  government  and  oonstruct 
users'  profiles.  In  the  last  year  of  the 
project  the  system  Is  expected  to  be 
fully  operatlonel  and  to  distribute 
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uruup  I 


( 

A 

B 

C 

( 

jBudgel  per  year 

$l»500 

$l»8.000 

$20,000 

Users  [Profiles! 

15  [?] 

38  [38] 

5  [111 

Documents  per  year 

<  , 

6000 

48,000 

4800 

Runs  per  year  ^ 

26 

^  i 

12 

— 

1 

Cost  per  user  [Profile] 

$300  [?] 

$1265  [$12651 

$400  [1818] 

Cost  per  document 

$0.75 

$1 

$4.17 

Cost  per  run 

$175 

$8000 

$1667 

Documents  per  run  [per  user]  . 

230  [151 

8000  [205] 

400  [80] 

Cost  per  user  per  run 

$12 

$210 

$333 

Table  II.  Experimental  and  Partially  Operational  Systems 


Group  1 1 


j Budget  per  year_  _  _ 

_ 

Oociywents  pe r  ytsr^  _ 

! 

;  B*'’  X®*JC _ 

*  Cost  per  user  [Profi le] 

;  i 

tost  pftr  docMmnt  .  .  „  , 

tgst  per  run_ . . i 

i 

i  j^uments  per  run  [per  user)  | 

icost  per  user  per  run  ■ 


$3000 

$8000 

.  U»-  LJ.L41  _ 

103  [1031 

16.000 

1500 

26 

'  26_ 

$26  [$26] 

$78  (S78J 

$5.33 

$115 

$308 

385  [3.4] 

58  [0.6] 

$1.00 

Table  III.  Seal  I  $01  Systems 
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1 — »  ^ 

_ B _ 

C 

D  __ 

Budget  per  year 

HREH 

$30,000 

mm 

260  [2601 

800  [1500] 

68  groups  [3861 

Documents  per  veer 

IH 

3000 

6000 

6000 

26 

12 

26 

52 

$25  [$25l 

i 

$96  [$96]  1  $38  [$20] 

$660  [$781 

Cost  per  document 

$0.15 

$8.35 

r 

$7.50 

$5.00 

Cost  per  run 

warn 

$2085 

$<155 _ 

$575 

2500  [6251 

IHnSfl 

mmm 

MRIilliPlllHi 

$1 

$8.00 

$1.65 

$8.65 

Medium  SOI  Systems 


Group  IV 


K 


^doet  per  veer 


$»S.000  '$$0.000 


industries  [12] 


$65.000 


$130.000 


$180.000 


users  [Profiles} 


220  [2203 


200  [7001 


2000  12000} 


700  m 


Pocuisents  per  veer 


86  .<100 


30.000 


62L 


22.000 


kuns  per  veer 


52 


SL 


12 


12 


220 


Cost  aer  user  [Prof i ttl 


$206 [$2061 


$6165 

■Ivni  II 


m.  am 


$255  [?] 


Cost  per  document 


iOJ3 


10J8. 


$2,15 


$218.22 


$8.18 


Pit  per  run 


m. 


J560. 


■uo-jag. 


$820 


bociiwMits  Jif  funtptr  Mserl6700  [301 


1660  [1601 


2500  [121 


52  [0.031 


JOOJOJI 


Cost  per  user  eer  run 
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L*JSL 


$5.60 


$1.20 


Teble  V.  Urge  SOI  Systems 


C.  COST  ANALYSIS 


wi  <  I- ly  nolic.x,.  Ih('  (ompuTor  '’.elected 
l<ii  thi'.  '.yslem  I'j  tin  Lnglish  Lloctric 
I- 1)1 

L.irly  in  I9(>H  the  first  SDl  runs 
will  1)0  m)ide.  A1  the  time  of  contact 
tii.itiy  questions  were  unanswerable  because 
()t  i)(  O'lunis  will  be  determined  by  practice. 
Ih.  index  inq  <ind  prof  i  ling  vocabulary  and 
itidi  xinq  techniques  are  fairly  well  os- 
I  r.  lishod.  However,  the  vocabulary  is 
.  •i-'i  led  to  grow  rapidly  for  awhile. 

I  Id.  .  inont  volume  is  expected  to  be  240 
(1.  I  uiiH-n1s  per  week  nr  about  lOOO  per 
iiidiilti.  lU'iwevor,  input  time,  percent 
dt  d. H.uments  .jccepted  for  SDl,  and  the 
IV' I  . Kid  number  of  terms  per  docuniont  are 
iii.iiiown.  r.u'h  user  provides  a  written 
'.i.iii'ment  of  interests,  and  staff  members 
if  iir.loto  it  into  a  user  profile  with  an 
iiiiP'M  limit  of  100  terms  connected  by 

i. i'ji.  operators.  lt)o  average  number  of 

is  unknown,  f’rofiles  will  be 
moil  i  lied  according  to  each  user's 
IS  .I'ssrtK'nt  of  his  notices.  He  will 
idii'lve  a  citation  and  index  terms  for 
d.n  h  relevant  documt>nt.  The  staff  and 
tti'>  (omputor  ant  available,  but  the 
amount  of  timet  that  will  bo  spent  on 
•ill  function  is  unknown. 

Since  this  project  is  a  test  of 
s.'i.  tivo  dissemination,  much  more  time 
,  ,i:  i‘i*  spent  .in.ily/lnq  all  the  parts  of 
III  ill  system  than  most  Installations 
II  illow.  Iho  final  opwnting  cost  and 
Id.  si(ii|ijes  of  this  system  will  certainly 

ii.  inldie'-.l  Ing  .ind  worthy  of  ln>depth 
si  iiy  after  the  system  ftecomes  fully 

!  it  tonal , 


We  found  that  Interpretation  of  SOI 
cost  varies  from  system  to  system,  pri¬ 
marily  because  SOI  is  a  subsystem.  For 
Instance  salaries  of  staff,  mailing  or 
distributing  notices,  cost  of  duplicating 
documents  or  appropriate  parts,  and  cost 
of  document  acquisition  are  some  of  the 
factors  that  may  be  Included  as  part  of 
the  cost;  but  whether  they  are  Included 
depends  on  each  installation's  budget 
policy  and  interpretation  of  SOI. 

Fourteen  systems  In  the  survey 
supplied  a  yearly  budget  figure.  The 
systems  were  divided  into  four  groups 
that  yeild  some  measure  of  compatibility 
within  each  group.  Most  of  tha  experi¬ 
mental  and  partially  operational  systems 
are  grouped  together  because  document 
volume  and  users  are  considerably  fewer 
than  each  system  plans  to  Include  In  a 
fully  operational  system.  Hence  relation¬ 
ships  between  cost  and  other  factors  are 
not  stable.  Small,  medium,  and  large  or 
commercial  systems  art  the  three  remaining 
groups. 

Group  I.  Experiments;  and  Partially 
Qparatlonal  Systems 

System  A  $  4,500. 

System  B  S  48,000. 

System  C  $  20,000. 

Group  11.  Small  Systems 

System  A  S  3,000. 


System  B  $  8,000 
Group  Ml.  Medium  Systems 
System  A  S  10,000. 

System  6  $  25,000. 

System  C  $  30,000. 

System  0  S  30,000. 
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( '.roup  1 V . 

1 

.irf)o  r]fi(J/or  Gommert 

f 

.y'.  lom57 

System 

A 

$ 

45,000. 

System 

B 

$ 

50,000. 

System 

C 

$ 

65,000. 

System 

D 

$ 

130,000. 

System 

t. 

$ 

180,000. 

The  f.ictors 

used  i n 

this  cost 

Ilia  lysis  are  document  volume,  number  of 
u'.ers,  and  number  of  SOI  runs  per  year. 
I.ibles  listing  the  calculations  used  for 
cost  analysis  are  found  within  the  next 
several  pages.  Question  marks  indicate 
l.ick  of  information. 

The  three  systems  in  Group  I  (Table 
II)  i I lustrate  that  SOI  systems  begin 
with  limited  Inputs.  Two  experimental 
‘.ys terns  have  large  enough  SOI  budgets 
to  be  in  Group  IV,  and  both  probably  will, 
once  they  become  fully  operational.  One 
of  these  uses  38  of  500  eligible  users. 
Currently  the  system  uses  National  Library 
•  •I  Medicine  MtOLARS  tape.  The  magnetic 
t.ipe,  issued  monthly,  contains  approx i^ 
Miitoly  fiOOO  citations.  However,  this 
installation  makes  an  SOI  run  every  other 
month  using  the  tape  for  one  month  only. 

Ihe  system  will  eventually  operate  twice 
a  month,  and  the  sources  for  input  will 
ho  Increased  to  Include  other  tape  ser¬ 
vices.  Hence  the  number  of  users  will 
increase  more  than  ten  times;  the  number 
of  SOI  runs  will  increase  four: times; 
and  the  document  tovei  wl 1 1  mote  than 
double  when  the  system  becomes  fully 
operational.  The  cost  per  user  for  each 
run,  presently  $210,  will  be  reduced  to  ^ 
approximately  $5  it  the  budget  remains 
constant . 

'F  •  '  ' 

Arwther  system  In  tftls  group  Is 
running  an  experiment  to  establish  an 
alertiiig  service  for  those,  involved  in 
il  I  abates  rem'arch.  The  source  Is  fixed;  i 
a  seloctlon  ii^  Ml  dinars*  Is  belfi^  used. 

I  lie  doctxaenf  Wlume,  AOO  documents  per 


run,  reflects  the  documents  selec^'ed  for 
the  five  ust  .  ather  than  the  dov-uments 
found  in  the  Diabetes  Literature  Index. 
the  portion  of  MEblANS'  ct tat tons.  This 
system  will  become  operational  in  a  year. 
The  cost  per  user  is  expected  to  be  greatly 
reduced,  since  there  is  a  large  population 
available  to  use  this  service. 

The  cost  per  user  for  each  SDI  run 
at  the  third  system  in  this  group  ($12) 
is  much  less  than  both  of  the  other  two 
experimental  systems.  However,  the  cost 
is  high  when  compared  to  operational 
systems.  This  system  is  using  only  15 
to  400  eligible  users  and  has  selected 
120  journals  for  the  test  collection. 

These  experimental  systems  do  not 
spend  money  directly  on  the  user  as  the 
fully  operational  systems  do.  Instead 
money  Is  spent  for  testing  approaches 
to  document  selection  and  Indexing, 
profile  construction  and  vocabulary  con¬ 
trol,  matching  technique,  and  for  testing 
and  debugging  equipment.  When  a  system 
is  fully  operational  the  cost  per  user 
Is  significantly  less  because  the  money 
spent  for  an  SDI  service  Is  more  directly 
channeled  into  sending  notices  to  users. 

Group  II  contains  two  small  systems. 
One  of  these  Is  a  partially  operational 
system  but  Is  Included  here  because  it 
Is  a  stable  perfected  system.  It  Is 
presently  mannually  operated;  but  after 
the  system  becomes  computerized,  the 
number  of  users  will  be  Increased.  The 
computerized  approach  will  operate  in 
the  same  manner  as  the  system  does  now. 

This  system  has  done  some  careful  cost 
analysis  bassd  on  notices.  The  total 
cost  for  each  summary  Is  20<  and  includes 
Indexing,  clerical  operations  and  supplies. 
With  the  present  number  of  users.  Indexing 
accounts  for  nearly  half  the  cost, 
indexing  time  should  remain  nearly  con¬ 
stant;  hence  as  the  number  of  scientists 
increase,  the  cost  per  user  will  decrease. 
For  the  year* s  trial  run  about  6,0(X) 
summaries  were  sent.  Each  usar  avaraged 
about  60  summarias  a  yaar,  for  a  yaarly 
cMt  per  user  of  $12.00.  Or.  Schneider, 
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•.y.li'iir.  ‘  1' i  •  iii<  t  ,  lo  l)f'  iiblo  jo 

||l•liv<■t  |(K)()  iM'liit-',  |)('(  yfiiir  tf)  Oiich 

icii I  i I  lor  $101)  (104  por  notice)  when 
III. •  •.y.O'ni  li.i*.  fhn  propor  numbor  of  users 
l A|>pn>xim.!foly  1000).  The  yenrly  cost 
(>oi  u‘.or  we  found  in  this  survey,  S?B, 
i'.  cons  i<  lor  ably  higfier  than  Dr.  ‘jchfiTdor's 
r.ih  iil.ited  cosi.  We  do  not  know  where 
Ihi*  d i  I  I ennires  arise. 


I  lie  oilier  syslom  operates  with  tne 
'.millesl  '.Dl  budget  in  the  survey. 
Allhoutjh  this  ‘‘.ystem  has  fewer  users 
tli.m  most  of  the  other  ten  systems  in 
(.ri.up  II  -  IV,  it  has  more  documents 
flrin  half  of  them  and  more  documents  per 
user  than  six  of  them.  This  computerized 
‘■ystem  uses  froe  language  and  Indexes 
.IfM  unK5nts  to  a  depth  of  10  terms.  No 
■  idili  I  ional  Information  is  available  to 
.•x(i|.iin  how  this  system  operates.  How- 
•  'V.  T,  III*'  cost  per  user  for  each  SOI  run 
i‘.  loasoriable  and  compatible  with  other 
•■y.tems. 


('•roup  III  contains  medium  size  SOI 
v  .lrms.  T‘te  dtxrumont  f  low  of  System  A 
ti  t),(K)0  a  year)  and  tho  cost  per  user 
ii.i  each  SDI  run  of  System  0  are  inter* 
I'.iiiK).  Only  two  other  fully  operational 
•.v.Uw.  equal  or  oxi ood  the  large  document 
in;.uf  par  run  of  System  A.  This  system 
•■ub  .<:r  lti«  s  to  Defense  Documentation 

i .  iitoi  for  document  input  In  the  form  of 
m.i')iM>tlr  tape.  The  Denter  handles  all 
•i.’iPiiso  .nnd  defense-rolated  documents; 
b.-iHf  diM  umont  supply  Is  large.  The 

■  idv.iiil.Kp!  of  rocelvlwj  ready-to-run 
.i.HittniMil  input  shows  in  cx>st  of  the 
.y.lurnt.  System  0  user,  group  profiles. 

ii. >-  •.y.tom  ‘.orvos  tho  majority  of  the 
fitx)  iTfi«)inoers  employed  there  by  esteb* 
li  hiiu|  profiios  for  68  groups.  The 
tia*iii<or  of  real  users  brings  the  cost 
•liwn  lo  approximately  $1  per  user, 
d>‘pon«t!n(|  on  how  imtny  people  use  SOI. 

A  .list  of  around  $1  por  user  ter  oech 
•in  run  I*,  t^nmpatlhle. 
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APPENDIX  A 


SDI 

QUESTIONNAIRE 
(Revised  for  ComsMterised  ^sterns) 


Office  of  Scientific  end  Teclmicei 
Information 

Office  of  Aerospace  Research 
United  States  Air  Force 
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1.  GENERAL  CONSIDERATIONS 


1.  What  is  the  name  of  your  organization?  ___________________________________ 

2.  Briefly  how  does  your  organization  define  the  SDl  concept?  __________ 

3.  What  is  the  formal  title  of  your  SDI  system?  _________________________ 

4.  How  long  has  your  SDI  system  been  in  operation?  years  months. 

5.  What  is  your  current  budget  for  operation  of  your  SDI  system?  • 

6.  Is  your  system  part  of  library  or  larger  information  center?  ■ 

7.  Is  your  SDI  system 

I  I  fully  operational 

t 

P]  partially  operational 
n  experimental 

6.  If  a  copy  is  available,  please  forward  a  system  manual.  Please  indicate 
if  you  would  like  it  returned. 

U.  USER 

1.  Is  population  limit  of  Ih4*  Hystem 
Q  Fixed? 

If  yes,  what  is  limit?  __________ 

pi  Open-ended 

What  is  population  size  at  present?  __________________ 
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USER  >  continued 


2.  WKat  is  population  composition? 

managerial 

— — .  ®**>*>f 

3.  How  many  people  are  eligible  users?  ______ 

4.  What  is  the  level  of  distribution?  _______ 

□  Individual 

r*^  Groups  related  by 
r~|  Discipline 
□  Administration 

Q  Task 

UL  DOCUMENTS 

4 

1.  Are  any  documents  indexed  belore  you  receive  them? 
If  Yes 

a.  From  ud«t  soarce(s)? _ 


h.  What  percent  of  total  documents  are  previously  indexed?  % 

2*  What  is  average  input  time  (indexing,  machine  coding,  and  bibliographic 
dotation)  per  document?  ' 

3.  At  what  rate  are  documents  processed?  _____  p«r  ________ 

4.  Of  the  documents  you  review,  approximately  what  percent  are  accepted 

by  the  syetem?  % 


IKK:UMKNTS  -  continued 


5.  Do  you  u*e 

Free  indexing  (without  benefit  of  controlled  vocabulary) 
Controlled  index  vocabulary 

a.  What  is  its  source? 

□  Internal 

D  External.  Name 

b.  How  many  terms  are  in  vocabulary  at  present? 

c.  At  what  rate  do  you  add  terms  to  vocabulary? 

6.  Is  the  number  of  index  terms  per  document  limited?  ________ 

If  Yofi.  lower  limit  _______ 

u|^r  limit  _______ 

7.  What  is  the  average  number  of  index  terms  per  document? 

8.  How  many  documents  are  one-time  publications  (technical  reports* 
bookSk  etc. )? 

How  many  one-time  publications  are  internal*?  ________ 

How  many  are  external?  ________ 

9.  How  many  documents  arc  journals  and  periodicals?  ________ 

Mow  many  journals  and  periodicals  are  internal*?  . 

How  many  are  external?  . 

10.  Is  any  indexing  performed  <mi  a  contract  basis?  ________ 

1 1 .  i'leasc  list  general  subject  areas  of  documents  tneluded  la  your  801  qrstein 

*  Generali*d  within  psrent  orgaaisMion 


IV.  PROFILE 


1.  Does  user  select  his  own  profile? 

If  Yes,  does  he  use 

n  Free  selection 
r*|  Controlled  vocabulary 
What  is  its  source? 

□  Internal 

rl  External  Name 
If  No,  briefly  describe  profiling  process 

2.  How  many  profiles  per  user  are  acce]^ed? 
How  many  profiles  are  now  in  the  system? 

S.  Is  the  munber  of  terms  per  profile  fixed?  ^ 
If  Yes,  lower  limit 
upper  limit 

4.  Do  you  use  a  weighting  system?  . 

If  Yes,  what  is  your  scale? 


5.  Caa  the  user  override  his  profile  and  select  automatic  distributioa 
of  epecific  aidhors.  Journal  titles,  corporate  authors,  etc.  ? 

6.  Caa  user  request  scaaaing  (moaitoriag)  of  specific  sources? 

7.  What  Is  the  averags  aaafier  of  terms  psr  laroftle  ? 
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V.  PROFILE  MODIFICATION 


1.  What  is  your  system  for  profile  modification? 
n  Returnable  notice  or  portion  of  notice 

i  2  Interviews  or  questionnaires  j 

How  frequently?  _ 

j  I  Other.  Please  explain. 

2.  How  often  do  you  modify  profiles? 

3.  Are  profiles  modified  by 

□  Adding  or  deleting  terms 
r~l  Modifying  weights 
r~]  Both  of  above 

4.  If  you  use  a  weighting  system,  how  are  weights  modified?  (Include  boolean  log: 
n  by  the  user 

If  user  changes  weights,  does  he  see 
1  I  list  of  matched  terms 
n  list  of  document  terms 
pi  other 

n  by  the  systom/please  explain.  ■ 


P 
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VI.  MATCHING 


1.  Matching  is  performed  by  oomparing 

r~|  individual  interat  profiles  against  individual  document  profiles. 
PI  aggregated^interest  profile  against  individual  document  profiles. 
n  individual  interest  profiles  against  aggregatedf* document  profile. 
n  aggregated  interest  profile  against  aggregated  document  profile. 

2.  Does  the  threshold  level  (point  below  which  m  notification  is  made) 
vary?  _________ 

a.  Who  initiates  threshold  modification 
n  user 

r~l  system 

b.  Is  the  threshold  varied 
n  document  by  document 
n  user  by  user 

c.  How  is  the  threshold  computed?  Please  explain  briefly. 


*  Combining  all  terms  into  a  single  master  list  of  u  rms. 

Vn.  NOTICE 


1.  Does  user  receive 
n  Citation 
n  Abstract 
P!  Accession  number 


n  Weight  or  match  fact  or 
D  matched  terms 

r~l  Document  index  terms 
pi  Return  request  form 
n  Profile  modification  form 
ri  Other 


Please  check  all  that  are  applicable 


n  □  n  □  n  n  LI  □  1 1 


NOTICE  -  continued 


2.  In  what  form  is  the  notice? 

Machine  print >out 

Pre -printed  form  (by  machine  or  manual  sources) 
Punch  caid 
Library  card 
Document 
Microfi  che 

Copy  of  table  of  contents 
Copy  of  cover 
Other  _ 


check  all  that  are  applicable. 
3.  How  often  do  you  issues  notices? 


Will  you  pl<'a.s<'  attach  a  copy  of  your  notice  form  if  it  la  available. 


VIU.  INTERNAL  SYSTEM  CONFIGURATION 


1.  Components 

a 

List  all  central  and  peripheral  EDP/ADP  equipment  used  in  the  operation 
of  ^our  SDl  system. 


Type /Model 


Time  Allocated  to  SDl  (in  hou 
Input*  Matching**  Output*** 


INTERNAL  SYSTEM  CONFIGURATION  -  continued 


2.  Personnel 

list  all  professional,  technical,  clerical  personnel  used  in  the  operation 
of  your  SDI.  system.  Indicate  contract  personnel  where  applicable. 


Time  Allocated  to  SDI  (in  hours) 

Job  Title  Input*  Matching**  Output*** 


*  Profile*  modi0cation  and  document  input 

**  Document  •  user  comparison 


***PrintinK  and/or  issuing  notices 


Ai»l>KNl)IX  B 


Participating  Organizations 


vNumbcrs  were  assigned  for  the  survey 
and  appear  throughout  the  report) 


(loytM-nment,  Private  Institutions,  and  University  SDI  Systems 


(Government:  14  instuUntions,  15  systems 

1.  II,  S.  Army  Kleetronics  Command,  Fort  Monmouth,  New  Jersey 

2.  II.  S.  Naval  Wcapon.s  Lalx)ratory,  Dahlgren,  Virginia 

:t.  II.  S.  Navy  thiroau  of  Ships  Technical  Library,  Washington,  D.  C. 

1*.  li.s.  Army  Natick  Laboratories,  Natick,  Massachusetts 
12.  Niielear  4Sufety  Information  Center,  U.  S.  Atomic  Energy  Commission, 

Oak  Uidge,  Tennessee 

It.  Fort  Ik'lriek  Technical  Information  Division,  Frederick,  Maryland 
15.  Harry  Diamond  Uilmratorics,  11.  S.  Army  Research  Office,  Washington,  D.  C. 
17.  Ilonneville  Power  Administration,  Portland.  Oregon 
|H.  II.  M.  Ikircau  of  Ri'clumation,  Denver,  Colorado 
'20.  Ames  l,iilK>mU>ry ,  U.S.  Atomic  Energy  Commission,  Amea,  Iowa 
'20..  27.  Nation.al  Aeronautics  and  Space  Administration,  Washington,  D.C. 

2H.  Natitmal  Cancer  Institute.  National  Institutes  of  Health,  Bethesda, 

Maryland 

20.  Foreign  Technology  Division,  Wright- Patterson  Air  Force  Base,  Ohio 
.‘1.5.  f.S.  Army  Mobility  Equipment  Research  and  Development  Center, 

Fort  Ik'lvoir.  Vii*ginia 

IM  ivaie  lns>iluU<ios:  21  installations,  22  systems 

*  I.  .s«>ari‘h  Information  hervice.  John  Crerar  Library,  Chicago,  Illinois 
d.  Aineri(*an  ('vancmid  Comp:my.  Pearl  River,  New  York 
7.  A|)pHi*<i  l%sios  Lbhoralory  Document  Library  found  at  Johns  Hopkins 
linlversiiy.  Silver  Stiring.  Maryland 
s.  ik’ll  I'c'lephonc  Laborutorics,  Inc. .  Murray  Hill,  New  Jersey 
to.  Aerithisiee  Research  Applications  Center,  Bloomington,  Indiana 


11.  Curtcr-Wallucc,  Inc.,  Cranberry,  New  Jersey 
13.  VITRO  Laboratories,  Silver  Spring,  Maryland 

16.  Naval  Reactor  Information  Center,  General  Electric  Co.,  Schenectady, 
New  York 

19.  Merican  Cyanamid  Conpany,  Bound  Brook,  New  Jersey 

21.  Health  Research,  Inc.,  Roswell  Park  Computer  Center,  Buffalo,  New  York 

22., 23.  Eastman  Kodak  Co.,  Rochester,  New  York 

24.  Western  Electric  Co.,  Princeton,  New  Jersey 

25.  Eli  Lilly  §  Company,  Indianapolis,  Indiana 

30.  IBM,  Owego,  New  York 

31.  IBM,  Armonk,  New  York  [corporate-wide  system] 

33.  Martin  Marietta  Corporation,  Orlando,  Florida 

34.  IBM,  Poughkeepsie  and  Kingston,  New  York 

36.  Douglas  Aircraft  Company,  Santa  Monica,  California 

37.  B.  F.  Goodrich,  Avon  Lake,  (Kiio 

38.  M  8  T  Chamicals  Rahway,  New  Jersey 

39.  Aero-Jet  General  Corporation,  Azusa,  California 

University:  1  installation,  1  system 

32.  Western  Reserve  University,  Cleveland,  Ohio 
SDL  Systems  in  the  United  Kingdom;** 

33.  Scientific  Documentation  Centre 

34.  Culham  Laboratory 

35.  Institute  of  Electrical  Engineers 


*These  systems  supply  SDI  services  to  other  industries,  government 
agencies  and  universities  either  as  part  or  all  of  their  function. 

**  These  systems  have  not  been  included  in  the  statistical  tabulation. 
Thoy  are  discussed  in  Chapter  I-P. 
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Detailed  Tabulation  of  Data 
IRcsponses  to  SDI  Questionnaire] 

llie  following  several  pages  contain  participants'  responses  to 
nearly  all  questions  from  the  survey  questionnaire.  Sene  questions 
arc  omitted  because  they  do  not  supply  meaningful  information. 
Explanations  supplied  by  participants  will  be  abbreviated  and 
included. 

Althreviations  used  on  the  following  pages  are  derived  from  the 
questionnaire,  and  it  is  hoped  that  they  have  been  selected  with 
care  in  order  to  be  obvious  when  the  questionnaire  is  referred  to. 
Although  the  reader  may  experience  some  inconvenience  while 
interpreting  abbreviations,  in  the  interest  of  conservation  of 
space,  it  seemed  as  u  best  approach. 

Some  rospondenco  failed  to  supply  answers  for  three  reasons: 
the  information  was  private;  the  information  was  unknown;  or  a 
quest ii>n  was  misunderstood  and  its  answer  was  incorrect.  Missidg 
answers  arc  coded  with  "?"  regardless  of  the  reason  for  omission. 

If  the  luestiori  is  not  applicable,  then  "N/A."  will  be  recorded. 


\.  Time  [in  years]  S,  Yearly  Hiidtfot  [in  thousands]  7.  Level  Of  Operation: 


0.75 

$10 

Fully 

0.67 

FrM  *  2  clerical  hrs/ao. 

Partial 

2.5 

? 

Experiment 

15.00 

$50 

Fully 

0.40 

9 

Experiment 

4.00 

$3 

Fully 

1.50 

? 

Partial 

0.75 

• 

Experiment 

5.00 

9 

Fully 

4.00 

$180 

Fully 

1.75 

$30 

Fully 

1.75 

9 

Fully 

2.40 

$48 

Partial 

0.25 

$4.5 

Experiment 

6.40 

$25 

Fully 

1.25 

$30 

Fully 

4.20 

$130 

Fully 

0.40 

$60  *  computer  rental 

Experiment 

2. SO 

$45 

Fully 

2.00 

? 

Fully 

1.40 

9 

Experiment 

0.40 

9 

Experiment 

4.00 

? 

Fully 

0.75 

$72  ♦  computer  rental 

Fully 

5.75 

? 

Fully 

1.00 

♦ 

• 

Experiment 

0.67 

$8 

Experiment 

5.00 

9 

« 

Fully 

s.oo 

9 

Fully 

2.75 

Fully 

1.00 

$60 

Experiment 

6.00 

$65 

Fully 

3.00 

? 

Fully 

2.67 

? 

F  [SDI  .rtlyl 

S.50 

? 

Fully 

0 

0 

F,  inactive 

17.00 

? 

Fully 

4.50 

? 

Fully 

SO 


II. 


l!.s<*r 


1. 

|)o|tiil:il ion  liinil 

•) 

Mana^rrial 

Pop  u 1  a  1  i  on 

0  m  p  0  ,s  i  t  i  0  n 

Other 

1.  Number  of  U8< 

()|M‘n 

75 

325 

400 

•  > 

()|K'n 

Both 

300 

:t. 

()|M‘n 

10 

0 

10 

1. 

()|)on 

12 

0 

12 

<1. 

<  )[HMI 

Both 

20 

7. 

OjHIl 

0 

114 

114 

•S. 

()|H-n 

Both 

800 

!). 

Fixed,  2ri  li.sofb 

0 

25 

25 

10. 

()p<'n 

Both 

78 

1  1. 

OlWMl 

.‘55 

665 

700 

12. 

OlMMi 

240 

560 

800 

i:5. 

0|M'n 

15 

1385 

1400 

1  1. 

())  ion 

1 

37 

38 

lii. 

Fixi'd,  Ui  Users 

Both 

IS 

10, 

Open 

0.5 

195 

260 

17. 

Fixed.  iOiio  Fsers 

0 

68 

68  Contacts 

In. 

(Ipeii 

too 

1600 

2000 

1!). 

<  )|  lt  d 

1 

72 

76 

Ml. 

Fi.\o<l,  10,  000  Users 

;i0 

190 

220 

■1. 

(IjU'H 

Both 

25 

1*1 

()|ion 

9 

11 

20 

2:1. 

(  IjH'H 

a 

S 

12 

:  1. 

( IjK-n 

200 

IMOO 

2000 

‘St. 

<  »|)on 

7 

73 

80 

M'. 

OlH‘n 

170 

680 

850 

^7. 

OjM'n 

100 

400 

500 

'N. 

Fixed,  lOO  Users 

0 

103 

103 

0|ren 

2% 

98% 

? 

;<i. 

(>l»en 

Both 

195 

n. 

OjKlI 

Both 

2800 

Fixed,  r)  Usi'rs 

0 

5 

5 

10 

190 

200 

*  i 

(l]h'  I 

0.5 

65 

130 

1  r 

OlM’H 

Doth 

700  Users 

(225  TssM 

Ui 

OjHil 

:t2 

293 

325 

•T 

<>lK’n 

Both 

Non-Operatioi 

(>lM'n 

Both 

? 

OlMll 

“t 

95 

100 

oeciion  u.  uiKjr  (Cont. ) 


3.  Number  of 
eligible  users 

4.  Profiles  of  Individuals  (I), 
related  by  discipline  (GD), 
Administration  (GA) 
Maioritv 

1. 

2000 

1 

2. 

1000 

I 

a. 

.'iO  Projects 

GT 

•i. 

Open 

G  Industry 

aoo 

I 

i  , 

700 

I 

.5000 

I 

i>. 

800 

I 

10. 

Op«'n 

I.  GD.  GA 

11. 

MS  &  Up 

CD 

12. 

4000 

T 

i:i. 

2000 

I.  GD.  GA 

M. 

500 

I,  GD,  GT.  GA 

iri. 

400  (40  for  pilot) 

I 

ifi. 

1100 

I 

17. 

GOO 

GT 

IH. 

2500 

I 

10. 

000 

I 

20. 

500  (dele  rmi  nod 

I 

21. 

by  budget  limit) 

21. 

Open 

I 

22. 

GOO 

I 

22. 

000 

1 

21. 

2400 

GD  lit  GA  (250  Profiles) 

25. 

775 

I 

20. 

Open 

I.  CD 

27. 

0|H)n 

GOli  GA 

2K. 

Open 

I 

20. 

? 

1.  GO 

ao. 

1500 

I 

:il. 

? 

1 

:i2. 

Open 

I 

aa. 

3000 

I 

ni. 

130 

1.  CD,  GA.  or 

ar*. 

700  (225  Tusks) 

GT 

.10. 

? 

I 

.17. 

GOO 

I 

38. 

7 

1 

ao. 

.3000 

1 

Groiqis 
task  (GT). 

Other _ 

GD,  GT,  GA 

GD 


GD 

GD 

GD,  GAG  Industry 

I  (50  Profiles) 

GT 

CD 

GD.  GT 
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r^i'ciion  111.  uiiiciiut 


I.  KxU'rnal  Siiurci*  ln|)ul 


1. 

DIX: 

•  » 

•  »* 

NASAASCAN 

Nono 

1. 

None 

•  ’>. 

CluMii  AIxsinu’Ls 

V. 

NASA 

ItUiM'ital  Author 

!*. 

l)l)(' 

10. 

('onl  raclor 

II. 

Noui' 

i:;. 

Noiu* 

1  ** . 

Nj)m‘ 

1 1. 

MKDIAIIS 

ir,. 

C\»iUraclor 

10. 

None 

17. 

None 

IN. 

ln(<M*nal  Author 

n». 

('hom.  AIjsl  rafts 

'0. 

Stnvral 

■1, 

N<»ih' 

'  1  » 

KnuiiU’f  riun  ItuU'x 

-  •  *  • 

t’hfoi.  Ahsirafts 

1  . 

DIX' 

i , 

Insi.  I'or.Sfi.  Im'o. 

*  1 1 , 

\OIK* 

• 

Noiu* 

*  S* 

Noiu* 

. 

Noiu* 

::ci. 

Nom* 

;:i. 

Sovrral 

l>ialM>u*u  lit.  I«‘ lex 

I)IH\  (uiv.  I'riiiiing 

Noru* 

Noiu* 

DIW  1  173  Korm 

*  • . 

Noiu* 

,  H 

Nom* 

M  KOI  ..MLS 

V  of  document  Input 
K.xtornni  Internal 


100% 

0% 

100% 

0% 

0% 

100% 

O'- 

100% 

100% 

0% 

15% 

85%, 

lOO': 

0% 

lO*' 

90% 

100% 

0% 

0'/?, 

100% 

t»% 

100% 

0*,^. 

100% 

100% 

0% 

100';,', 

0% 

0% 

100% 

o';i 

100% 

12% 

88% 

100% 

0% 

100',' 

0%, 

1>% 

100% 

100', 

0%', 

100% 

O'? 

10':, 

90'.? 

100% 

O'? 

O'l 

lOO'l 

O'. 

100% 

0*. 

100% 

O'l. 

100', 

Oi. 

lOO'i', 

:io% 

70*% 

100% 

0% 

00*4 

m 

O', 

lOO'f 

O',- 

100% 

OS'? 

O'; 

lOO'i 

o;. 

lOO'i 

100', 

O'? 

2.  Input  processing 
time  per  document 


N/A 
N/A 
25  Min. 

2  Min. 

N/A 

45  Min. 

5  Min. 

38  Min. 

N/A 

67  Min. 

? 

N/A  -  Staff  Decision 
N/A 

20  Min. 

25  Min. 

10  Min. 

6. 5  Hours 

N/A 

N/A 

6  .Min. 

N/A 

N/A 

14  Min. 

N/A 
90  Min. 

90  Min. 

8  Min. 

? 

21  Min. 

3  Min. 

N/A 

13  Min. 

? 

N/A 
40  Min. 

23  Min. 

N/A,  Staff  Decision 
N/A 
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•1.  Percent  accepted 


:i.  DtH-unuMits/mo. 

reviewed  for  SI)I  input 


1. 

.5000 

100 

2. 

BOO 

100 

:t. 

1200 

7r> 

1, 

7200 

10 

t». 

500 

100 

7. 

8:to 

90 

25 

100 

!». 

110 

? 

10. 

9000 

100 

11. 

2200 

? 

12. 

3ao 

85 

i:i. 

16,000 

9S 

11. 

8000  per  tape 

100 

If). 

500 

100 

10. 

250 

? 

17. 

500 

98 

IH. 

1250 

4 

1». 

500 

100 

20. 

29,000 

100 

21. 

? 

96 

500 

100 

•1*1 

500 

100 

21. 

aoo 

15 

25. 

? 

80 

20. 

5500 

97 

27. 

5500 

97 

2S. 

100 

90 

2!» 

6500 

1000 

;ui. 

HtO 

80 

:il. 

aooo 

50 

;i2. 

400 

100 

XI. 

2500 

15 

:ii, 

1200 

90 

a  5. 

500 

25 

ao. 

2aoo 

85 

a7. 

40*; 

80 

a  8. 

? 

7 

ao. 

400 

100 

S^ection  m.  Documents  (Cont. ) 


r.. 

Vwabulury  Fm?, 

CnnlrolliHl  (C) 
and  Hourco  if  C 

If  Controlled 
number  of 
terms 

Number  of 

terms 

added 

6,  7  Indexing 
Lower  Unwr 
limit  limit 

Aver- 

age 

1. 

Subacribers  N/A 

2. 

Subacribers  N/A 

C. ,  Int.  like  EJC 

4700 

9/1000  doc. 

- 

“ 

20 

4. 

Manual  Scanning,  n/A 

«. 

Subscribes,  N/A 

7. 

Free 

8. 

C. ,  Internal 

ISO 

? 

- 

- 

3 

9. 

C.  •  Ext. ,  4  thesauri 

? 

- 

- 

8 

10. 

Contractor,  N/A 

11. 

C. ,  Int.  (1200  drugs) 

1800 

10/wk. 

1 

10 

6 

12. 

C. ,  Internal 

1500 

4/wk. 

3 

48 

7 

12. 

Manual  Scanning,  N/A 

14. 

Subscribes,  N/A 

Ifi. 

Controlled 

3000 

? 

- 

- 

15 

Ki. 

C. ,  Internal,  1000  idenifiers 

3000 

3/wk. 

- 

116 

13 

2000  subjects 

17. 

Free 

200 

7 

18. 

C. ,  Internal 

3300 

0 

1 

k5 

20 

19. 

Subscribes,  N/A 

20. 

Free 

- 

titles, 

abstracts 

texts 

21. 

Controlled 

2000 

? 

- 

225 

7 

22. 

Subscribes,  N/A 

22. 

Subscribes,  N/A 

24. 

C. ,  Internal 

? 

? 

8 

14 

10 

25. 

Subscribes,  N/A 

26. 

C. ,  Internal 

18,000 

7 

- 

- 

13 

27. 

C. ,  Internal 

18,000 

7 

"3 

2H. 

C. ,  Intciiuil 

15,000 

40/wk. 

- 

- 

4.3 

2U. 

C. ,  Internal 

15,000 

4/wk, 

2 

15 

8 

20. 

C. ,  Intemal  »  DDC 

2500  ^  DOC 

2/wk. 

- 

- 

10 

ai. 

Free 

text 

.12. 

Subacribes,  N/A 

aa. 

C. .  Internal 

14,000 

7/mo. 

3 

8 

5 

.11. 

C. ,  'idt'mal 

70  catc|p>ries 

7 

- 

N/A 

a5. 

C. ,  Intemal 

3000 

7 

- 

- 

4  *  aaod> 

i&ers 

ao. 

C. ,  Internal 

13,500 

180/yr. 

1 

100 

14 

.17. 

C. ,  Int.  like  AlClIS 

8000 

40/mo. 

- 

- 

24 

(stored 

of  40) 

.18. 

Manual  Scanning,  N/A 

39. 

C. ,  Internal 

7000 

SO/mo. 

- 

30 

H.  Number  of  one-time  urlieleH 
per  month 


9.  Number  of  journals 
per  month 


10.  Use 

Contractors 


Internal 

External 

Internal 

External 

X. 

N/A 

2. 

N/A 

a. 

1200 

0 

0 

0 

no 

■1. 

0 

300 

0 

6840 

no 

f). 

N/A 

7. 

0 

822 

0 

8 

no 

8. 

25 

0 

0 

0 

no 

9. 

0 

11 

1 

98 

yes  (pilot  test) 

10. 

N/A 

yes 

11. 

0 

0 

0 

2200 

no 

12. 

8 

157 

16 

149 

yes 

la. 

0 

10,000 

0 

0 

no 

14. 

N/A 

15. 

0 

0 

0 

500 

yes 

1«. 

250 

0 

0 

0 

no 

17. 

0 

0 

0 

500 

no 

18. 

20 

8 

0 

19 

no 

19. 

N/A 

20. 

0 

0 

0 

30.  000 

no 

21. 

all 

no 

22. 

N/A 

23. 

N/A 

24. 

00 

15 

0 

225 

no 

25. 

N/A 

20. 

1417 

4083 

0 

0 

yes 

27. 

1417 

408.3 

0 

0 

yes 

28. 

0 

0 

0 

100 

no 

29. 

? 

? 

? 

? 

yes 

ao. 

lOH 

479 

0 

193 

no 

31. 

1440 

1440 

15 

92 

no 

32. 

N/A 

33. 

0 

1750 

0 

750 

no 

34. 

0 

0 

5 

1195 

no 

35. 

80 

420 

0 

0 

no 

30. 

345 

1955 

0 

0 

no 

37. 

400 

0 

0 

0 

no 

38. 

some 

some 

most 

no 

39. 

200 

140 

0 

0 

no 
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Section  IV.  Profiles 


1. 

Profile  voe:il»ul;irv 
const  ruet  ion 

2.  Number  of  J’rofiles 
nia.ximum  In  System 

allowed 

3.  Limit  of  terms 
Lower  Upper 

Averag 

1. 

l)l)(’  voenl. 

1 

4000 

24 

i 

•» 

NASA /SPAN  list 

N/A 

Serves 

189  topics 

200 

' 

:i. 

Inlenuil  voe;ii). 

Usually  i 

10 

- 

- 

: 

i. 

Krec'  l;in};u;i^e 

onjy  i  neces¬ 

12 

- 

- 

. 

sary 

. 

Pn-e  lan^iinjje 

open 

30 

- 

- 

r’ 

|.’|•(‘e,  staff  ;n(l 

open 

114 

- 

- 

s. 

:!  ini.  'I'he.saiii’i 

1 

800 

- 

- 

it. 

1  (‘.\t.  llu'saui'i 

.> 

30 

- 

- 

10. 

I.  1)(K’.  ine.  vocal). 

open 

? 

1 

150 

49 

:J.  Subject  list 

N/A 

? 

- 

- 

II. 

Pl'C'C 

1 

? 

- 

- 

i:i. 

Inlcrnal  vocal). 

3 

1500 

- 

- 

i:i. 

pj’ce 

1 

159 

- 

- 

1 1. 

MKSII  (nn'dicalj 

1 

38 

- 

400 

10. 

('onl  I'olied  c!ilej;ofi«'s  uml  terms 

experimental 

? 

- 

7  diff. 

phrases 

1(1. 

Pi'i'c,  translated  by  .staff 

only  1  neces¬ 

260 

- 

1000 

sary 

17. 

PriH'.  slaff  aid 

many  cncour- 

386 

rarely 

aged,  open 

ceeds 

IS. 

Inlci'nal,  siaff  aid 

1 

2000 

1 

20 

lit. 

I'l’cc,  staff  aid 

open 

240 

- 

- 

'0. 

Prcc 

u;) 

220 

- 

9999 

130 

■1. 

p,-.-. 

:)(i 

(50 

1 

60 

I■!n”l...  w  .  .  Indc.x 

iOO 

j  • 

- 

- 

Prci' 

20 

•12 

- 

- 

■■  1. 

Inlcrnal  vocab. 

(i 

300 

- 

- 

■'.1, 

Prc)’,  sl.iff  aid 

open 

250 

1 

1000 

i  ‘t  I . 

InUM'nal  vocab. 

usually  1 

.850 

- 

- 

. .  t . 

Ini..  NASA  SCAN  list 

N/A 

Borves  .500 

- 

189  categorios 

■  *  s 

1  sc...  resiMia'b  j;ranl 

1 

103 

- 

- 

18 

|ni«'  rnal 

usually  1 

2lh 

- 

- 

::ii 

Inlcrnal  DIH' 

o|>on 

195 

- 

300 

.  ‘  i . 

Pi't  c  .siall  .tid 

1 

2. '(00 

- 

- 

.MK.sil  siafi  ai)l 

open 

11 

1 

200 

80.4 

..... 

Inlcrnal.  .slaff  aid 

op  cn 

700 

1 

8 

4 

.;  1 . 

Inlcrnal  Il’.M'.SCAN 

N/A 

•> 

- 

70 

..... 

InU'i'iial.  slaff  aid 

;;  if  necessary 

225 

- 

- 

5 

lnlcrii.il,  sl.iff  aid 

oiKm 

44(i 

- 

- 

13 

. 

lin.'i'ii.d,  slaff  aid' 

o|H;n 

^  ♦ 

5 

125 

37 


:iH.  KriH*  (iiil.v  I  ncfossary  ?  -  - 

.’li).  Inlornal,  hImIT  aid  i  100  -  >10 


'*  In  addition  user  submits  1  page  job  and  duties  summary  that,  staff  index  and  add  to  profile 


+  ♦  llcady  to  run,  but  non-opcrational.  If  system  were  to  go  on-line,  200  profiles  would  be  used. 
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IV.  I’rol'ilc.s  (Coni.) 


I.  Uso  Wri;;ltl.s  «>!•  logic 


r>.  Can  usi^r  gel  auto-  G.  Docs  user  suggest 
malic  distrib.  of  document  sources  ? 

named  ilcr.s? 


1. 

i  1 1  >  1 1  t 

no 

no 

•  1 

11(1 

no 

no 

• » 

links,  roles 

yes 

yes 

I . 

iui,  iinncccssarv.  manual 

yes 

yes 

li  . 

no 

no  (done  outside  system) 

no 

.inn.  or.  not 

no  (done  outside  system) 

1  -fi 

s. 

iiiornrchal  .ilpha  taxic 

yes 

yes 

;t. 

n  ;o  0.1)'.)!)  (0  is  nog. ) 

yes 

yes 

0. 

(1  )  -1)  lo  i!)  (2)  no 

no 

no 

1. 

no 

yes 

yes 

• » 

(1)  .scale  options  (2)  anti,  or,  n<«t 

yes 

yes 

•  > 

no.  uniieet'H.sarv.  manual 

yes 

yes 

1. 

hoiilean  ihrt'shoUl  !0  exactly 

no 

no 

•  *. 

0.  1.2  Tor  ile.seriplor.s 

i  retiueiu'v  <•01110  lor  calogoricH 

yes 

no 

(  . 

anil.  <ir.  not.  Iiierarchal,  pro))al>Uslic  or 

yes 

no 

1 , 

1  to  ID,  boolean  o|H'raio)‘s,  I'txg  matching 

yes 

yes 

nia\.  1  "s 

no 

no 

ami.  or,  not:  option  1  to  lOOO 

yes 

no 

0, 

1).  0001  i<i  (I.!)!)!)!)  nug.  r<K>i  matching 

yes 

yes 

:  • 

n  lo  il* 

yes 

yes 

' 

no 

yes 

yes 

no 

.ves 

yos 

I  to  It  ami  -  I  aiitl  root  matching 

yes 

no 

ih  1 

yes 

no 

.;',lil.s  .«*;iiini:itsii.,  i.Kiolean  logic 

yes 

yes 

\<.n.;li)s  .siiiHiialiiig  IxMileat)  logic 

yes 

yes 

o.  r.ii’chal  ili  con.il 

no 

no 
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